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SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

This application is a continuation-in-part of Ser. No. 60/XXX,XXX (converted to 
a provisional application from non-provisional application Ser. No. 08/845.296), filed 
April 25, 1997, which is incorporated by reference herein. 

FIELD OF THE IN VENTTON 
The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

B ACKGROI JND OF THF. INVENTION 
Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 
clone novel polynucleotides "directiy" in the sense that they rely on information directiy 
related to fl»e discovered protein (i.e., partial DNA/amino acid sequence of tiie protein in 
the case of hybridization cloning; activity of the protein in die case of expression cloning). 
More recent "indirect" cloning techniques such as signal sequence cloning, which isolates 
DNA sequences based on the presence of a now well-recognized secretory leader 
sequence motif, as well as various PCR-based or low stringency hybridization cloning 
techniques, have advanced the state of the art by making available large numbers of 
DNA/amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of Uie 
cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
polynucleotides encoding Uiem that the present invention is directed. 
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SUMMARY OF THE INVENTION 
In one embodiment the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
5 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 99 to nucleotide 902; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
1 0 NO:l from nucleotide 162 to nucleotide 902; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 87 to nucleotide 219; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done ci25_4 deposited under accession number 

X5 ATCC 98415; 

(f) a poljrnucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ci25_4 deposited under accession number ATCC 98415; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ci25_4 deposited under accession number ATCC 

2 0 98415; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone ci25_4 deposited under accession number ATCC 98415; 

(i) a pol)mucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:2; 

25 (j) a polynudeotide encoding a protein comprising a fragment of the 

amino add sequence of SEQ ID NO:2 having biological activity, the fragment 
comprising the amino add sequence from amino add 129 to amino add 138 of 
SEQIDNO:2; 

(k) a polynucleotide which is an allelic variant of a polynudeotide of 
30 (a)-(h) above; 

0) a polynudeotide which encodes a spedes homologue of the protein 
of (i) or (j) above ; and 

(m) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(j). 
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Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NOtl from nucleotide 99 to nucleotide 902; the nucleotide sequence of SEQ ID NO:l from 
nucleotide 162 to nucleotide 902; the nucleotide sequence of SEQ ID NO:l from 
nucleotide 87 to nucleotide 219; the nucleotide sequence of the full-length protein coding 
5 sequence of clone ci25_4 deposited under accession number ATCC 98415; or the 
nucleotide sequence of a mature protein coding sequence of done ci25_4 deposited under 
accession number ATCC 98415. In other preferred embodiments, the polynucleotide 
encodes the full-length or a mature protein encoded by the cDNA insert of done ci25_4 
deposited under accession nimiber ATCC 98415. 
1 0 Ottier embodiments provide the gene corresponding to the cDNA sequence of SEQ 

IDNOrl. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

1 5 (a) the amino add sequence of SEQ ID NO:2; 

(b) fragments of the amino acid sequence of SEQ ID NO:2 comprising 
the amino add sequence from amino add 129 to amino add 138 of SEQ ID NO:2; 
and 

(c) the amino add sequence encoded by the cDNA insert of done 

2 0 ci25_4 deposited imder accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino add sequence of SEQ ID NO:2. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynudeotide selected from the group consisting of: 
25 (a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

NaS; 

(b) a pol)mucleotide comprising the nudeotide sequence of SEQ ID 
NO:3 from nucleotide 283 to nucleotide 1158; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 

3 0 NO:3 from nudeotide 1 to nucleotide 789; 

(d) a polynucleotide comprising the nudeotide sequence of the full- 
length protein coding sequence of done da228_6 deposited imder accession 
number ATCC 98415; 
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(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone da228_6 deposited under accession number ATCC 98415; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done da228_6 deposited under accession nimiber 

5 ATCC 98415; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone da228_6 deposited imder accession number ATCC 98415; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:4; 

10 (i) a polynudeotide encoding a protein comprising a fragment of the 

amino add sequence of SEQ ID NO:4 having biological activity, the fragment 
comprising the amino add sequence from amino add 141 to amino add 150 of 
SEQIDNO:4; 

(j) a polynucleotide which is an allelic variant of a polynudeotide of 
15 {a)-(g) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

2 0 Preferably, such polynudeotide comprises the nucleotide sequence of SEQ ID 

NO:3 from nudeotide 283 to nudeotide 1158; the nudeotide sequence of SEQ ID NO:3 
from nudeotide 1 to nudeotide 789; the nucleotide sequence of the full-length protein 
coding sequence of clone da228_6 deposited under accession number ATCC 98415; or the 
nudeotide sequence of a mature protein coding sequence of done da228_6 deposited 
25 under accession number ATCC 98415. In other preferred embodiments, the 
polynudeotide encodes the full-length or a mature protein encoded by the cDNA insert 
of done da228_6 deposited under accession nimiber ATCC 98415. In yet other preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:4 from amino add 1 to amino add 169. 

3 0 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

iDNas. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 
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(a) the amino add sequence of SEQ ID NO:4; 

(b) the amino add sequence of SEQ ID NO:4 from amino add 1 to 
amino add 169; 

(c) fragments of the amino add sequence of SEQ ID NO:4 comprising 
5 the amino add sequence from amino add 141 to amino add 150 of SEQ ID NO:4; 

and 

(d) the amino add sequence encoded by the cDNA insert of clone 
da228 6 deposited under accession number ATCC 98415; 

the protein bing substantially free from other mammalian proteins. Pteferably sud, 
10 proteincomprisestheaminoaddsequenceofSEQIDNO:4ortheaminoaddsequence 

of SEQ ID Na4 from amino add 1 to amino add 169. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynudeotide selected from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

15 N05; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
N05 from nucleotide 152 to nudeotide 2182; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 

N05 from nudeotide 2 to nudeotide 931; 
20 (d) a polynudeotide comprising ttie nudeotide sequence of the full- 

length protein coding sequence of done du410_5 deposited under accession 

number ATCC 98415; 

(e) a polynudeotide encoding the fuU-length protein encoded by the 
cDNA insert of done du410_5 deposited under accession number ATCC 98415; 
25 (f) a polynudeotide comprising the nucleotide sequence of a mahure 

protdn coding sequence of done du410_5 deposited under accession number 

ATCC 98415; 

(g) a polynudeotide encoding a mature protein encoded by the cDN A 
insert of clone du410_5 deposited under accession number ATCC 98415; 
30 (h) a polynudeotide encoding a protein comprising the amino add 

sequence of SEQ ID NO:6; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID Na6 having biological activity, the fragment 
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comprising the amino acid sequence from amino acid 333 to amino add 342 of 
SEQIDNa6; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aMg) above; 

5 (k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes imder stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
1 0 NO:5 from nucleotide 152 to nucleotide 2182; the nucleotide sequence of SEQ ID NO:5 
from nucleotide 2 to nucleotide 931; the nucleotide sequence of the full-lengti:\ protein 
coding sequence of done du410_5 deposited under accession number ATCC 98415; or the 
nucleotide sequence of a mature protein coding sequence of done du410_5 deposited 
under accession number ATCC 98415. In other preferred embodiments, tiie 
1 5 polynudeotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone du410_5 deposited imder accession number ATCC 98415. In yet other preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:6 from amino add 1 to amino add 260. 
Other embodiments provide tiie gene corresponding to the cDNA sequence of SEQ 
20 IDNaS. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:6; 
25 (b) the amino add sequence of SEQ ID NO:6 from amino acid 1 to 

amino add 260; 

(c) fragments of the amino add sequence of SEQ ID N0:6 comprising 
the amino add sequence from amino add 333 to amino acid 342 of SEQ ID NO:6; 
and 

30 (d) the amino add sequence encoded by the cDNA insert of done 

du410_5 deposited under accession number ATCC 98415; 
the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:6 or the amino add sequence 
of SEQ ID NO:6 from amino add 1 to amino add 260. 
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In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7; 

5 (b) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:7 from nucleotide 51 to nudeohde 611; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:7 from nucleotide 1 to nucleotide 525; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
1 0 length protein coding sequence of done eh80_l deposited under accession number 

ATCC 98415; 

(e) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of done eh80_l deposited vmder accession number ATCC 98415; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
1 5 protein coding sequence of done eh80_l deposited under accession number ATCC 

98415; 

(g) a polynudeotide encoding a mature protein encoded by the cDNA 
insert of done eh80_l deposited tmder accession number ATCC 98415; 

(h) a polynucleotide encoding a protein comprising the amino add 

2 0 sequence of SEQ ID NO:8; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising the amino add sequence from amino acid 88 to amino add 97 of SEQ 
IDNO:8; 

25 (j) a polynudeotide which is an allelic variant of a polynudeotide of 

(aHg) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynudeotide that hybridizes under stringent conditions to any 

3 0 one of the polynudeotides specified in (a)-{i). 

Preferably, such polynudeotide comprises the nudeotide sequence of SEQ ID 
NO:7 from nudeotide 51 to nudeotide 611; the nudeotide sequence of SEQ ID Na7 from 
nudeotide 1 to nudeotide 525; the nudeotide sequence of the full-length protein coding 
sequence of done eh80_l deposited under accession number ATCC 98415; or the 
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nucleotide sequence of a mature protein coding sequence of clone eh80_l deposited under 
accession number ATCC 98415. In other preferred embodiments, the polynucleotide 
encodes the full-length or a mature protein encoded by the cDNA insert of clone eh80_l 
deposited under accession number ATCC 98415. In yet other preferred embodiments, 
5 the present invention provides a polynucleotide encoding a protein comprising the amino 
add sequence of SEQ ID NO:8 from amino acid 1 to amino acid 158. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNa7. 

In other embodiments, the present invention provides a composition comprising 
10 a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:8; 

(b) the amino add sequence of SEQ ID NO:8 from amino add 1 to 
cimino acid 158; 

1 5 (c) fragments of the amino add sequence of SEQ ID NO:8 comprising 

the amino add sequence from amino add 88 to amino add 97 of SEQ ID NO:8; and 

(d) the amino add sequence encoded by the cDNA insert of clone 
eh80_l deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. Preferably such 
2 0 protein comprises the amino add sequence of SEQ ID NO:8 or the amino add sequence 
of SEQ ID NO:8 from amino add 1 to amino acid 158. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

25 Na9; 

(b) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 431 to nudeotide 559; 

(c) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nudeotide 518 to nucleotide 559; 

30 (d) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:9 from nucleotide 190 to nudeotide 547; 

(e) a polynudeotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone er369_l deposited under accession 
number ATCC 98415; 
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(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone er369_l deposited under accession number ATCC 98415; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done er369_l deposited under accession number 

5 ATCC 98415; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone er369_l deposited under accession number ATCC 98415; 

(i) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NaiO; 

10 (j) a polynudeotide encoding a protein comprising a fragment of the 

amino add sequence of SEQ ID NO:10 having biological activity, the fragment 
comprising the amino add sequence from ammo add 16 to amino acid 25 of SEQ 
IDNOrlO; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 

15 (a)-(h) above; 

(1) a polynudeotide which encodes a spedes homologue of the protein 

of (i) or (j) above ; and 

(m) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynudeotides spedfied in (a)-(j). 
20 Preferably, such polynudeotide comprises \he nucleotide sequence of SEQ ID 

Na9 from nudeotide 431 to nudeotide 559; the nucleotide sequence of SEQ ID NO:9 from 
nucleotide 518 to nudeotide 559; tiie nudeotide sequence of SEQ ID NO:9 from 
nucleotide 190 to nucleotide 547; the nudeotide sequence of the full-length protein coding 
sequence of done er369_l deposited imder accession number ATCC 98415; or the 
25 nudeotide sequence of a mature protein coding sequence of done er369_l deposited 
under accession number ATCC 98415. In other preferred embodiments, the 
polynudeotide encodes the full-lengtii or a mature protein encoded by the cDNA insert 
of done er369_l deposited under accession number ATCC 98415. In yet otiier preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
3 0 comprising the amino add sequence of SEQ ID NO:10 from amino add 1 to amino add 
39. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
IDNO:9. 
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In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NOlO; 
5 (b) the amino acid sequence of SEQ ID NO:10 from amino add 1 to 

amino add 39; 

(c) fragments of the amino add sequence of SEQ ID NO:10 comprising 
the amino add sequence from amino add 16 to amino add 25 of SEQ ID NOlO; 
and 

10 (d) the amino add sequence encoded by the cDNA insert of clone 

er369_l deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. Preferably such 

protein comprises the amino add sequence of SEQ ID NO:10 or the amino add sequence 

of SEQ ID NO:10 from amino acid 1 to amino add 39. 
15 In one embodiment, the present invention provides a composition comprising an 

isolated polynudeotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NQll; 

(b) a polynudeotide comprising the nucleotide sequence of SEQ ID 
2 0 NO:ll from nudeotide 91 to nucleotide 2838; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nudeotide 2209 to nucleotide 2838; 

(d) a poljmudeotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nudeotide 839 to nucleotide 1197; 

25 (e) a polynudeotide comprising the nudeotide sequence of the full- 

length protein coding sequence of done fhl23_5 deposited imder accession 
number ATCC 98415; 

(f) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of clone fhl23_5 deposited imder accession number ATCC 98415; 

30 (g) a polynudeotide comprising the nucleotide sequence of a mature 

protein coding sequence of done fhl23_5 deposited under accession number 
ATCC 98415; 

(h) a polynudeotide encoding a mature protein encoded by the cDNA 
insert of clone fhl23_5 deposited under accession number ATCC 98415; 



10 



wo 98/49302 



PCT/US98/08336 



(i) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:12; 

(j) a pol)mudeotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO: 12 having biological activity, the fragment 
5 comprising the amino add sequence from amino add 453 to amino acid 462 of 

SEQIDNO:12; 

(k) a pK)l5mudeotide which is an allelic variant of a polynudeotide of 
(a)-(h) above; 

0) a polynudeotide which encodes a spedes homologue of the protein 
10 of (i) or Q) above ; and 

(m) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynudeotides spedfied in (a)-{j). 

Preferably, such polynucleotide comprises the nudeotide sequence of SEQ ID 
NO:ll from nudeotide 91 to nudeotide 2838; the nucleotide sequence of SEQ ID NOll 

1 5 from nudeotide 2209 to nudeotide 2838; the nudeotide sequence of SEQ ID NOrll from 
nucleotide 839 to nudeotide 1197; the nudeotide sequence of the full-length protein 
coding sequence of done fhl23_5 deposited under accession number ATCC 98415; or the 
nudeotide sequence of a mature protein coding sequence of clone fhl23_5 deposited 
under accession nimiber ATCC 98415. In other preferred embodiments, the 

2 0 polynudeotide encodes the full-length or a mature protein encoded by the cDNA insert 
of clone fhl23_5 deposited under accession number ATCC 98415. In yet other preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:12 from amino add 251 to amino add 
369. 

2 5 Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 

ID Nail. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

3 0 (a) the amino add sequence of SEQ ID NO:12; 

(b) the amino add sequence of SEQ ID NO:12 from amino acid 251 to 
amino add 369; 
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(c) fragments of the amino add sequence of SEQ ID NO:12 comprising 
the amino add sequence from amino add 453 to amino add 462 of SEQ ID NO:12; 
and 

(d) the amino add sequence encoded by the cDNA insert of done 
5 fhl23_5 deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the anuno add sequence of SEQ ID NO:12 or the amino add sequence 
of SEQ ID NO:12 from amino add 251 to amino add 369. 

In one embodiment, the present invention provides a composition comprising an 
1 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

Nai3; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:13 from nudeotide 568 to nudeotide 978; 

15 (c) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:13 from nudeotide 1084 to nucleotide 1854; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done fm60_l deposited under accession 
number ATCC 98415; 

20 (e) a polynudeotide encoding the full-length protein encoded by the 

cDNA insert of clone fm60_l deposited imder accession number ATCC 98415; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done fm60_l deposited imder accession number 
ATCC 98415; 

25 (g) a polynudeotide encoding a mature protein encoded by the cDNA 

insert of clone fm60_l deposited imder accession number ATCC 98415; 

(h) a polynudeotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:14; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
3 0 amino add sequence of SEQ ID NO:14 having biological activity, the fragment 

comprising the amino add sequence from amino acid 63 to amino add 72 of SEQ 
IDNO:14; 

(j) a polynucleotide which is an allelic variant of a polynudeotide of 
(aHg) above; 
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(k) a polynucleotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i)- 
5 Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 

NO:13 from nucleotide 568 to nucleotide 978; the nucleotide sequence of SEQ ID NO:l3 
from nucleotide 1084 to nucleotide 1854; the nucleotide sequence of the full-length protein 
coding sequence of done fin60_l deposited under accession number ATCC 98415; or the 
nudeotide sequence of a mature protein coding sequence of done fm60_l deposited under 
10 accession number ATCC 98415. In other preferred embodiments, the polynucleotide 
encodes the full-length or a mature protein encoded by the cDNA insert of done fm60_l 
deposited under accession number ATCC 98415. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNOria. 

15 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:14; 

(b) fragments of the amino add sequence of SEQ ID NO:14 comprising 
20 the amino add sequence from amino add 63 to amino add 72 of SEQ ID NO:14; 

and 

(c) the amino add sequence encoded by the cDNA insert of done 
fm60_l deposited tmder accession ntimber ATCC 98415; 

the protein being substantially free from other mammalian proteins. Preferably such 
2 5 protein comprises the amino add sequence of SEQ ID NO: 14. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

Nai5; 

30 (b) a polynudeotide comprising the nudeotide sequence of SEQ ID 

NO:15 from nudeotide 16 to nucleotide 309; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:15 from nudeotide 127 to nudeotide 309; 
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(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done fr473_2 deposited under accession 
number ATCC 98415; 

(e) a polynucleotide encoding the full-length protein encoded by the 
5 cDNA insert of clone fr473_2 deposited imder accession number ATCC 98415; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone fr473_2 deposited under accession number 
ATCC 98415; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
1 0 insert of clone fr473_2 deposited under accession number ATCC 98415; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:16; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:16 having biological activity, the fragment 

15 comprising the amino add sequence from amino add 44 to amino add 53 of SEQ 

IDNO:16; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a spedes homologue of the protein 
20 of (h) or (i) above ; and 

(1) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynudeotides specified in (a)-(i). 

Preferably, such polynudeotide comprises the nucleotide sequence of SEQ ID 
NO:15 from nudeotide 16 to nudeotide 309; the nucleotide sequence of SEQ ID NO:15 

2 5 from nudeotide 127 to nucleotide 309; the nucleotide sequence of the full-length protein 
coding sequence of done fr473_2 deposited imder accession number ATCC 98415; or the 
nucleotide sequence of a mature protein coding sequence of done fr473_2 deposited 
under accession number ATCC 98415. In other preferred embodiments, the 
polynudeotide encodes the full-length or a mature protein encoded by the cDNA insert 

30 of done fr473_2 deposited under accession number ATCC 98415. In yet other preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:16 from amino add 1 to amino add 
58. 
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Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
IDNai5. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
5 consisting of: 

(a) the amino acid sequence of SEQ ID NO:16; 

(b) the amino add sequence of SEQ ID NO:16 from amino add 1 to 
amino acid 58; 

(c) fragments ofthe amino add sequence of SEQ ID NO:16 comprising 
10 the amino add sequence from amino add 44 to amino add 53 of SEQ ID NOIS; 

and 

(d) the amino add sequence encoded by the cDNA insert of clone 
fr473_2 deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. Preferably such 
1 5 protein comprises the amino add sequence of SEQ ID NO:16 or the amino add sequence 

of SEQ ID NO:16 from amino acid 1 to amino add 58. 

In certain preferred embodiments, the polynudeotide is operably linked to an 

expression control sequence. The invention also provides a host ceU, induding bacterial, 

yeast, insect and manmialian cells, transformed with such polynucleotide compositions. 
2 0 Also provided by the present invention are organisms that have enhanced, reduced, or 

modified expression of the gene(s) corresponding to the polynucleotide sequences 

disclosed herein. 

Processes are also provided for producing a protein, which comprise: 

(a) growing a culture of the host cell transformed with such 
2 5 polynucleotide compositions in a suitable culture meditim; and 

(b) purifying the protein from the culture. 

The protein produced according to such methods is also provided by the present 
invention. Preferred embodiments indude those in which the protein produced by such 
process is a mature form of the protein. 
30 Protein compositions of the present invention may further comprise a 

pharmaceutically acceptable carrier. Compositions comprising an antibody which 
specifically reacts with such protein are also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
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effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figures lA and IB are schematic representations of the pED6 and pNOTs vectors, 

respectively, used for deposit of clones disclosed herein. 

DETAILED DESCRIPTION 
ISOLATED PROTEINS AND POLYNUCLEOTIDES 

10 Nucleotide and amino acid sequences, as presently determined, are reported 

below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each done can readily be determined by sequencing of the deposited clone 
in accordance with known methods. The predicted amino add sequence (both full-length 
and mature forms) can then be determined from such nucleotide sequence. The amino 

15 acid sequence of the protein encoded by a particular done can also be determined by 
expression of the done in a suitable host cell, collecting the protein and determining its 
sequence. For each disclosed protein applicants have identified what they have 
determined to be the reading frame best identifiable with sequence information available 
at the time of filing. 

20 As used herein a "secreted" protein is one which, when expressed in a suitable host 

cell, is transported across or through a membrane, induding transport as a result of signal 
sequences in its amino add sequence. "Secreted" proteins indude without limitation 
proteins secreted wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
in which they are expressed. "Secreted" proteins also indude without limitation proteins 

2 5 which are transported across the membrane of the endoplasmic reticulum. 

Clone "d25 4" 

A polynucleotide of the present invention has been identified as clone "ci25_4". 
ci25_4 was isolated from a human adult brain cDNA library using methods which are 

3 0 selective for cDN As encoding secreted proteins (see U.S. Pat. No. 5336,637), or was 

identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. ci25_4 is a full-length done, 
induding the entire coding sequence of a secreted protein (also referred to herein as 
*'ci25_4 protein"). 
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The nucleotide sequence of ci25_4 as presently determined is reported in SEQ ID 
NO:l. What applicants presently believe to be the proper reading frame and the predicted 
amino add sequence of the ci25_4 protein corresponding to the foregoing nucleotide 
sequence is reported in SEQ ID NO:2. Amino adds 9 to 21 are a predicted leader/signal 
5 sequence, with the predicted mature amino add sequence begiiming at amino add 22, or 
are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 
d25_4 should be approximately 1700 bp. 

The nudeotide sequence disdosed herein for ci25_4 was searched against the 
10 GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 
PASTA search protocols. ci25_4 demonstrated at least some similarity with sequences 
identified as AA243050 {2r24h03.rl Stratagene NT2 neiironal precursor 937230 Homo 
sapiens cDNA done 664373 5*), AA316800 (EST188485 HCC cell line (matastasis to liver 
in mouse) 11 Homo sapiens cDNA 5* end), AA340783 {EST46083 Fetal kidney U Homo 
1 5 sapiens cDNA 5* end), Q05686 (Islets of Langerhans cell done ICA123 (ATCC 40703)), 
R12690 (yf40e07.sl Homo sapiens cDNA done 129348 3'), R16432 (yf40e07.rl Homo 
sapiens cDNA done), W81653 (zd84dl2.rl Soares fetal heart NbHH19W Homo sapiens 
cDNA done 347351 5"), and W81654 (zd84dl2.sl Soares fetal heart NbHH19W Homo 
sapiens cDNA done 347351 3'). Based upon sequence similarity, ci25_4 proteins and each 

2 0 similar protein or peptide may share at least some activity. The TopPredll computer 

program predicts five additional potential transmembrane domains within the d25_4 
protein sequence, centered around amino adds 81, 134, 159, 182, and 241 of SEQ ID NO:2, 
respectively. 

25 Clone "da228 6" 

A pol)mudeotide of the present invention has been identified as clone "da228_6". 
da228_6 was isolated from a human adult placenta cDNA library using methods which 
are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

3 0 analysis of the amino add sequence of the encoded protein. da228_6 is a full-length done, 

including the entire coding sequence of a secreted protein (also referred to herein as 
"dia228_6 protein"). 

The nudeotide sequence of da228_6 as presently determined is reported in SEQ 
ID NO:3. What applicants presentiy believe to be the proper reading frame and the 
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predicted amino acid sequence of the da228_6 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:4. 

The EcoRI/NotI restriction fragment obtainable from the deposit contaiiung clone 
da228_6 should be approximately 1500 bp. 
5 The nucleotide sequence disclosed herein for da228_6 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. da228_6 demonstrated at least some similarity with sequences 
identified as W57906 {zdl7fll.rl Soares fetal heart NbHH19W Homo sapiens cDNA done 
340941 5') and W57907 (zdl7fll.sl Soares fetal heart NbHH19W Homo sapiens cDNA 
10 clone 340941 3'. Based upon sequence similarity, da228_6 proteins and each similar 
protein or peptide may share at least some activity. 

aone"du410 5" 

A |K>lynudeotide of the present invention has been identified as clone "du410_5". 

1 5 du410_5 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^36,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein, du410_5 is a full-length done, 
including the entire coding sequence of a secreted protein (also referred to herein as 

2 0 "du410_5 protein"). 

The nudeotide sequence of du410_5 as presently determined is reported in SEQ 
ID NO:5. What applicants presently believe to be the proper reading frame and the 
predicted amino add sequence of the du410_5 protein corresponding to the foregoing 
nudeotide sequence is reported in SEQ ID NO:6. 

2 5 The EcoRI/NotI restriction fragment obtainable from the deposit containing done 

du410_5 should be approximately 2400 bp. 

The nudeotide sequence disdosed herein for du410_5 was searched against the 
GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. du410_5 demonstrated at least some similarity with sequences 

3 0 identified as N44315 (EST51pl9 WATMl Homo sapiens cDNA done 51pl9) and N66980 

(yz58d04.sl Homo sapiens cDNA clone 287239 3'). The predicted amino add sequence 
disdosed herein for du410_5 was searched against the GenPept and GeneSeq amino add 
sequence databases using the BLASTX search protocol. The predicted du410_5 protein 
demonstrated at least some similarity to sequences identified as U67604 (PI 15 protein 
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[MethanoccKCUS jannaschii]). Based upon sequence similarity, du410_5 proteins and each 
similar protein or peptide may share at least some activity. 

Qone^ehSO 1" 

5 A polynucleotide of the present invention has been identified as clone "eh80_l". 

eh80_l was isolated firom a human adult blood (peripheral blood mononuclear cells 
treated with granulocyte-colony stimulating factor in vivo) cDNA library using methods 
which are selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or 
was identified as encoding a secreted or transmembrane protein on the basis of computer 
1 0 analysis of the amino add sequence of the encoded protein. eh80_l is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"eh80_l protein"). 

The nucleotide sequence of eh80_l as presently determined is reported in SEQ ID 
NO:7. What applicants presently believe to be the proper reading frame and the predicted 
15 amino acid sequence of the eh80_l protein corresponding to the foregoing nucleotide 
sequence is reported in SEQ ID NO:8. Another potential eh80_l reading frame and 
predicted amino add sequence is encoded by basepairs 41 to 1659 of SEQ ID NO:7 and 
is reported in SEQ ID NO:25. A frameshift in the nudeotide sequence of SEQ ID NO:5 
between about nudeotide 41 to about nucleotide 614 could join together portions of the 
2 0 overlapping reading frames of SEQ ID NO:8 and SEQ ID NO:25. 

The EcoRI/NotI restriction firagment obtainable from the deposit containing done 
eh80_l should be approximately 2000 bp. 

The nudeotide sequence disdosed herein for eh80_l was searched against the 
GenBank and GeneSeq nudeotide sequence databases using BLASTN/BLASTX and 

2 5 FASTA search protocols. eh80„l demonstrated at least some similarity with sequences 

identified as AA012957 (ze27b03.rl Soares retina N2b4HR Homo sapiens cDNA done 
360173 5'), AA019878 (ze63b03.sl Soares retina N2b4HR Homo sapiens cDNA done 
363629 3'), AA505456 (nh84c07.sl Na_CGAP_Brl.l Homo sapiens cDNA done IMAGE 
965196), Q60246 (Human brain Expressed Sequence Tag EST02242), R16603 (yf43c04.rl 

3 0 Homo sapiens cDNA done 129606 5'), and T85469 (yd82f05.rl Homo sapiens cDNA done 

114753 5'). The predicted amino add sequence disdosed herein for eh80_l was searched 
against the GenPept and GeneSeq amino add sequence databases using the BLASTX 
search protocol. The predicted eh80_l protein demonstrated at least some similarity to 
sequences identified as U40747 (FBP 11 [Mus musculus]). Based upon sequence 
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similarity, eh80_l proteins and each similar protein or peptide may share at least some 
activity. The TopPredll computer program predicts two potential transmembrane 
domains within the amino add sequence of SEQ ID NO:8, one centered aroimd amino 
acid 107 and another aroimd amino add 131. 

5 

Clone "er369 1" 

A polynudeotide of the present invention has been identified as done "er369_l". 
er369_l was isolated from a human fetal brain cDNA library using metiiods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5^36,637), or was 
1 0 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. er369_l is a full-length done, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"er369_l protein"). 

The nudeotide sequence of er369_l as presently determined is reported in SEQ ID 
1 5 NO:9. What applicants presently believe to be the proper reading frame and the predicted 
amino add sequence of the er369_l protein corresponding to the foregoing nudeotide 
sequence is reported in SEQ ID NOrlO. Amino adds 17 to 29 are a predicted leader/signal 
sequence, with the predicted mature amino add sequence beginning at amino add 30, or 
are a transmembrane domain. 

2 0 The EcoRI/NotI restriction fragment obtainable from the deposit containing done 

er369_l should be approximately 1500 bp. 

The nudeotide sequence disclosed herein for er369_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. er369_l demonstrated at least some similarity with sequences 
25 identified as H12227 (yml2gl0.rl Homo sapiens cDNA done 47729 5'), H70978 
(yr73g06.rl Homo sapiens cDNA done 210970 5'), M79179 (EST01327 Homo sapiens 
cDNA done HHCP081), Q61324 (Human brain Expressed Sequence Tag EST01327), and 
R53554 (yg84e04.sl Homo sapiens cDNA done 39854 3' similar to contains Alu repetitive 
element). Based upon sequence similarity, er369_l proteins and each similar protein or 

3 0 peptide may share at least some activity. The nudeotide sequence of er369_l indicates 

that it may contain ah Alu repetitive element. 
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gone "fhl23 5" 

A polynucleotide of the present invention has been identified as clone "fhl23_5". 
fhl23_5 was isolated from a human fetal brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5^36,637), or was 
5 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. fhl23_5 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"fhl23_5 protein"). 

The nucleotide sequence of fhl23_5 as presently determined is reported in SEQ ID 
10 NO:ll. What applicants presently believe to be the proper reading frame and the 
predicted amino add sequence of the fhl23_5 protein corresponding to the foregoing 
nudeotide sequence is reported in SEQ ID Nai2. Amino adds 694 to 706 are a predicted 
leader/signal sequence, with the predicted mature amino add sequence beginning at 
amino add 707, or are a transmembrane domain. 
1 5 The EcoRI/NotI restriction fragment obtainable from the deposit containing done 

fhl23_5 should be approximately 2800 bp. 

The nucleotide sequence disclosed herein for fhl23_5 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. fhl23_5 demonstrated at least some similarity with sequences 
2 0 identified as AA815253 (ai64d02.sl Soares testis NHT Homo sapiens cDNA done 1375587 
3'), AA855689 (vw71h04.rl Stratagene mouse heart (#937316) Mus musculus cDNA clone 
1260439 5'), and W80785 (zd83d07.sl Soares fetal heart NbHH19W Homo sapiens cDNA 
clone 347245 3). The predicted amino add sequence disdosed herein for fhl23_5 was 
searched against the GenPept and GeneSeq amino add sequence databases using the 
25 BLASTX search protocol. The predicted fhl23_5 protein demonstrated at least some 
similarity to sequences identified as D80005 (KIAA0183 [Homo sapiens]). Based upon 
sequence similarity, fhl23_5 proteins and each similar protein or peptide may share at 
least some activity. The TopPredll computer program predicts five additional possible 
transmembrane domains within the fhl23_5 protein sequence. 

30 

Clone "fm60 1" 

A polynudeotide of the present invention has been identified as done "fm60_l". 
fm60_l was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
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identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino add sequence of the encoded protein. ftn60_l is a full-length done, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"£m60_l protein"). 

5 The nucleotide sequence of fm60_l as presently determined is reported in SEQ ID 

NO:13. What applicants presently believe to be the proper reading frame and the 
predicted amino add sequence of the fm60_l protein corresponding to the foregoing 
nudeotide sequence is reported in SEQ ID NO;14, 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 

1 0 fm60_l should be approximately 2200 bp. 

The nudeotide sequence disdosed herein for fm60_l was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. fm60_l demonstrated at least some similarity with sequences 
identified as AA155574 (zo70a01.sl Stratagene pancreas (#937208) Homo sapiens cDNA 

15 done 592200 3'), AF015147 (Homo sapiens done HS19.1 Alu-Ya5 sequence), N86095 
(J6377F Fetal heart. Lambda ZAP Express Homo sapiens cDNA clone J6377 5' similar to 
REPETmVE ELEMENT ALU), U14567 (***ALU WARNING Human Alu-J subfamily 
consensus sequence), and Z82199 (Human DNA sequence from done J316D5). Based 
upon sequence similarity, fm60_l proteins and each similar protein or peptide may share 

20 at least some activity. The TopPredll computer program predicts a potential 
transmembrane domeiin within the fm60_l protein sequence centered around amino add 
50 of SEQ ID NO:14. The nudeotide sequence of fm60_l indicates that it may contain one 
or more of the following repetitive elements: Alu, LI. 

25 Gone "fr473 2" 

A polynucleotide of the present invention has been identified as clone "fr473_2". 
fr473_2 was isolated from a human adult placenta cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 

3 0 analysis of the amino add sequence of the encoded protein. fr473_2 is a full-length clone, 
induding the entire coding sequence of a secreted protein (also referred to herein as 
"fr473_2 protein"). 

The nudeotide sequence of fr473_2 as presently determined is reported in SEQ ID 
NO: 15. What applicants presently believe to be the proper reading frame and the 
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predicted amino add sequence of the fr473_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:16. Amino acids 25 to 37 are a predicted 
leader/signal sequence, with the predicted mature amino add sequence t>eginning at 
amino add 38, or are a transmembrane domain. Amino adds 62 to 74 are another possible 
5 leader/ signal sequence, with the predicted mature amino add sequence beginning at 
amino add 75, or are a transmembrane domain. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing done 
fr473_2 should be approximately 605 bp. 

The nudeotide sequence disdosed herein for fr473_2 was searched against the 

10 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. fr473_2 demonstrated at least some similarity with sequences 
identified as AA479559 (zu42a02.rl Scares ovary tumor NbHOT Homo sapiens cDNA 
clone 740618 5' similar to WP:F49C12.12 CE03372), H46855 {yol8g04.rl Homo sapiens 
cDNA done 178326 5'), T24372 (Human gene signature HUMGS06404), W31692 

1 5 (zb93d01.rl Soares parathyroid tumor NbHPA Homo sapiens cDNA done 320353 5'), and 
Z32877 (H. sapiens partial cDNA sequence; done HEA41P; single read). The predicted 
amino add sequence disdosed herein for fr473_2 was searched against the GenPept and 
GeneSeq amino add sequence databases using the BLASTX search protocol. The 
predided fr473_2 protein demonstrated at least some similarity to sequences identified 

20 as Z68227 (F49C12.12 [Caenorhabditis elegans)). Based upon sequence similarity, fr473_2 
proteins and each similar protein or peptide may share at least some activity. 

Eteposit of Clones 

Qones ci25_4, da228_6, du410_5, eh80_l, er369_l, fhl23_5, fm60_l, and fr473_2 
25 were deposited on April 25, 1997 with the American Type Culture Collection (10801 
University Boulevard, Manassas, Virginia 20110-2209 U.S.A.) as an original deposit under 
the Budapest Treaty and were given the accession number ATCC 98415, from which each 
done comprising a particular polynudeotide is obtainable. All restrictions on the 
availability to the public of the deposited material will be irrevocably removed upon the 
3 0 granting of the patent, except for the requirements spedfied in 37 C.F.R. § 1.808(b), and 
the term of the deposit will comply with 37 C.F.R. § 1 .806. 

Each clone has been transfected into separate bacterial cells (£. coli) in this 
composite deposit Each done can be removed from the vector in which it was deposited 
by performing an EcoRI/NotI digestion (5' site, EcoRI; 3' site, NotI) to produce the 
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appropriate fragment for such done. Each done was deposited in either the pED6 or 
pNOTs vector depicted in Figures lA and IB, respectively. The pED6dpc2 vector 
("pED6") was derived from pED6dpcl by insertion of a new polylinker to facilitate 
cDNA doning (Kaufman et ah, 1991, Nucleic Adds Res. 19: 44854490); the pNOTs vector 
5 was derived from pMT2 (Kaufman et al, 198% MoL Cell. Biol 9: 946-958) by deletion of 
the DHFR sequences, insertion of a new polylinker, and insertion of the M13 origin of 
replication in the Clal site. In some instances, the deposited done can become "flipped" 
(i.e., in the reverse orientation) in the deposited isolate. In such instances, the cDNA insert 
can still be isolated by digestion with EcoRI and NotL However, NotI will then produce 
10 the 5* site and EcoRI will produce the 3' site for placement of the cDNA in proper 
orientation for expression in a suitable vector. The cDNA may also be expressed from the 
vectors in which they were deposited. 

Bacterial cells containing a particular clone can be obtained from the composite 
1 5 deposit as follows: 

An oligonudeotide probe or probes should be designed to the sequence that is 
known for that particular clone. This sequence can be derived from the sequences 
provided herein, or from a combination of those sequences. The sequence of an 
oligonudeotide probe that was used to isolate or to sequence each full-length done is 
2 0 identified below, and should be most reliable in isolating the clone of interest 



Clone 


Probe Sequence 


ci25_4 


SEQ ID NO:17 


da228_6 


SEQIDNO:18 


25 du410_5 


SEQIDNO:19 


eh80_l 


SEQIDNO:20 


er369_l 


SEQIDNO:21 


fhl23_5 


SEQIDNO:22 


hn60_l 


SEQIDNO:23 


30 fr473_2 


SEQIDNO:24 



In the sequences listed above which indude an N at position 2, that position is occupied 
in preferred probes/primers by a biotinylated phosphoaramidite residue rather than a 
nudeotide (such as , for example, that produced by use of biotin phosphoramidite (1- 
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dimethoxytrityioxy-2-(N-biotinyl-4-aminobutyl)-propyl-3-0-(2K:yanoethyl)-{N,N^ 
diisopropyl)-phosphoramadite) (Glen Research, cat. no. 10-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

5 (a) It should be designed to an area of the sequence which has the fewest 

ambiguous bases ("NTs"), if any; 
(b) It shotild be designed to have a T„ of approx. 80 ° C (assuming 2° for each 
A or T and 4 degrees for each G or C). 
The oligonucleotide should preferably be labeled with g-^P ATP (specific activity 6000 
10 Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 
label should preferably be removed by gel filtration chromatography or other established 
methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
1 5 probe should be approximately 4e+6 dpm/pmole. 

The bacterial culture containing the pool of full-length clones should preferably 
be thawed and 100 pi of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampidllin at 100 pg/ml. The culture should preferably be 
grown to saturation at 37^C, and the saturated culture should preferably be diluted in 

2 0 fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 

dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampidllin at 100 
pg/ml and agar at 1.5% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 
25 Standard colony hybridization procedures should then be used to transfer the 

colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g NaCl/liter, 88.2 g Na dtrate/liter, adjusted to pH 7.0 with 
NaOH) containing 0.5% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 

3 0 10 mL per 150 mm filter). Preferably, the probe is then added to tiie hybridization mix at 

a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 
incubated at 65°C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/05% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 
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A third wash with 0,1X SSC/0;5% SDS at 65^C for 30 minutes to 1 hour is optional. The 
filter is then preferably dried and subjected to autoradiography for sufficient time to 
visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

5 The positive colonies are picked, grown in culture, and plasmid DNA isolated 

using standard procedures. The clones can then be verified by restriction analysis, 
hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 

1 0 protein may be in linear form or they may be cyclized using known methods, for example, 
as described in H.U. Saragovi, et al, Bio/Technology 10, 773-778 (1992) and in R.S. 
McDowell, et a/., J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated 
herein by reference. Such fragments may be fused to carrier molecules such as 
immimoglobulins for many purposes, including increasing the valency of protein binding 

1 5 sites. For example, fragments of the protein may be fused through "linker" sequences to 
the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
decavalent form of the protein of the invention, 

20 The present invention also provides both full-length and mature forms of the 

disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of the nucleotide sequence of each disclosed clone. The mature 
form(s) of such protein may be obtained by expression of the disclosed full-length 
polynucleotide (preferably those deposited with ATCC) in a suitable mammalian cell or 

25 other host cell. The sequence(s) of the mature form(s) of the protein may also be 
determinable from the amino acid sequence of the full-length form. 

The present invention also provides genes corresponding to the polynucleotide 
sequences disclosed herein. "Corresponding genes" are the regions of the genome that 
are transcribed to produce the mRNAs from which cDNA polynucleotide sequences are 

3 0 derived and may include contiguous regions of the genome necessary for the regulated 
expression of such genes. Corresponding genes may therefore include but are not limited 
to coding sequences, 5' and 3' imtranslated regions, alternatively spliced exons, introns, 
promoters, enhancers, and silencer or suppressor elements. The corresponding genes can 
be isolated in accordance with known methods using the sequence information disclosed 
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herein. Such methods include the preparation of probes or primers from the disclosed 
sequence information for identification and /or amplification of genes in appropriate 
genomic libraries or other sources of genomic materials. An "isolated gene" is a gene that 
has been separated from the adjacent coding sequences, if any, present in the genome of 
5 the organism from which the gene was isolated. 

Organisms that have enhanced, reduced, or modified expression of the gene(s) 
corresponding to the polynucleotide sequences disclosed herein are provided. The 
desired change in gene expression can be achieved through the use of antisense 
polynucleotides or ribozymes that bind and/or cleave the mRNA transcribed from the 

1 0 gene (Albert and Morris, 1994, Trends Pharmacol Sci. 15(7): 250-254; Lavarosky et al, 1997, 
Biochem. Mol Med, 62(1): 11-22; and Hampel, 1998, Prog. Nucleic Acid Res. Mol Biol 58: 1- 
39; aU of which are incorporated by reference herein). Transgenic animals that have 
multiple copies of the gene(s) corresponding to the poljnnucleotide sequences disclosed 
herein, preferably produced by transformation of cells with genetic constructs that are 

15 stably maintained within the transformed cells and their progeny, are provided. 
Transgenic animals that have modified genetic control regions that increase or reduce 
gene expression levels, or that change temporal or spatial patterns of gene expression, are 
also provided (see European Patent No. 0 649 464 Bl, incorporated by reference herein). 
In addition, organisms are provided in which the gene(s) corresponding to the 

2 0 polynucleotide sequences disclosed herein have been partially or completely inactivated, 
through insertion of extraneous sequences into the corresponding gene(s) or through 
deletion of all or part of the corresponding gene(s). Partial or complete gene inactivation 
can be accomplished through insertion, preferably foUowed by imprecise excision, of 
transposable elements (Plasterk, 1992, Bioessays 14(9): 629-633; Zwaal et al, 1993, Proc, Natl 

2 5 Acad. Sci. USA 90(16): 7431-7435; Clark et <z/., 1994, Proc. Natl Acad. Sci. USA 91(2): 719-722; 

all of which are incorporated by reference herein), or through homologous recombination, 
preferably detected by positive/negative genetic selection strategies (Mansour et al, 1988, 
Nature 336: 348-352; VS. Patent Nos. 5,464,764; 5,487,992; 5,627,059; 5,631,153; 5,614, 396; 
5,616,491; and 5,679,523; all of which are incorporated by reference herein). These 

3 0 organisms with altered gene expression are preferably eukaryotes and more preferably 

are mammals. Such organisms cire useful for the development of non-htunan models for 
the study of disorders involving the corresponding gene(s), and for the development of 
assay systems for the identification of molecules that interact with the protein product(s) 
of the corresponding gene(s). 
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Where the protein of the present invention is membrane-bound (eg., is a receptor), 
the present invention also provides for soluble forms of such protein. In such forms part 
or ail of the intracellular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
5 transmembrane domains of proteins of the invention can be identified in accordance with 
known techniques for determination of such domains from sequence information. 

Proteins and protein fragments of the present invention include proteins with 
amino add sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 

10 identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 
identity) with that disclosed protein, where sequence identity is determined by comparing 
the amino add sequences of the proteins when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 

15 (more preferably 20 or more, most preferably 30 or more) contiguous amino adds that 
shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disdosed proteins. 

Spedes homologues of the disdosed polynudeotides and proteins are also 
provided by the present invention. As used herein, a "species homologue" is a protein or 

20 polynucleotide with a different species of origin from that of a given protein or 
polynucleotide, but with significant sequence similarity to the given protein or 
polynucleotide. Preferably, polynucleotide species homologues have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% identity) with 
the given polynudeotide, and protein species homologues have at least 30% sequence 

2 5 identity (more preferably, at least 45% identity; most preferably at least 60% identity) with 

the given protein, where sequence identity is determined by comparing the nudeotide 
sequences of the polynucleotides or the amino add sequences of the proteins when 
aligned so as to maximize overlap and identity while minimizing sequence gaps. Spedes 
homologues may be isolated and identified by making suitable probes or primers from 

3 0 the sequences provided herein and screening a suitable nudeic add source from the 

desired spedes. Preferably, spedes homologues are those isolated from mammalian 
spedes. Most preferably, spedes homologues are those isolated from certain mammalian 
spedes such as, for example. Pan troglodytes. Gorilla gorilla, Pongo pygmaeus, Hylobates 
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coricolor, Macaca mulatta, Papio papio, Papio hamadryas, Cercopithecus aethiops, Cebus capucinus, 
Aottis trwirgatus, Sanguinus oedipus, Microcebus murinus, Mus musculiis, Rattus norvegicus, 
Cricetulus griseus. Felts catus, Mustek vison. Cants familiaris, Oryctolagtts cuniculiis. Bos taurtis, 
Ovis aries, Sus scrofa, and Equus caballus, for which genetic maps have been created 
5 allowing the identification of syntenic relationships behveen the genomic organization of 
genes in one species and the genomic organization of the related genes in another species 
(O'Brien and Seuanez, 1988, Ann. Rev. Genet. 22: 323-351; O'Brien et al, 1993, Nature 
Genetics 3:103-112; Johansson et al, 1995, Genomics 25: 682-690; Lyons et a/., 1997, Nature 
Genetics 15: 47-56; O'Brien et al, 1997, Trends in Genetics 13(10): 393-399; Carver and Stubbs, 
10 1997, Genome Research 7:1123-1137; all of which are incorporated by reference herein). 

The invention also encompasses allelic variants of the disclosed polynucleotides 
or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotides 
which also encode proteins which are identical or have significantiy similar sequences to 
those encoded by the disclosed polynucleotides. Preferably, allelic variants have at least 
1 5 60% sequence identity (more preferably, at least 75% identity; most preferably at least 90% 
identity) with the given polynucleotide, where sequence identity is determined by 
comparing the nucleotide sequences of the polynucleotides when aligned so as to maximize 
overiap and identity while minimizing sequence gaps. ADelic variants may be isolated and 
identified by making suitable probes or primers from the sequences provided herein and 
2 0 screening a siutable nucleic acid source from individuals of the appropriate species. 

The invention also includes polynucleotides with sequences complementary to 
those of the polynucleotides disclosed herein. 

The present invention also includes polynucleotides tiiat hybridize under reduced 
stringency conditions, more preferably stringent conditions, and most preferably highly 
25 stringent conditions, to polynucleotides described herein. Examples of stringency 
conditions are shown in the table below: highly stringent conditions are those that are at 
least as stringent as, for example, conditions A-F; stringent conditions are at least as 
stringent as, for example, conditions G-L; and reduced stringency conditions are at least 
as stringent as, for example, conditions M-R. 
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Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 
(bp)^ 


Hybridization Temperature and 
Buffer* 


Wash 

and Buffer* 


A 


DNA:DNA 


a 50 


65''C; IxSSC -or- 

42''C; IxSSC, 50% formamide 


65 "C; 03xSSC 


B 


DNA:DNA 


<50 


Tb*; IxSSC 


V; IxSSC 


c 


DNAJ?NA 


i 50 


67**C; IxSSC -or- 

45°C; IxSSC, 50% formamide 


67**r- 0 ^Y^if^c 


D 


DNArRNA 


<50 


V; IxSSC 


Td»; IxSSC 


E 


RNA:RNA 


i 50 


70**C; IxSSC -or- 

SO^C; IxSSC, 50% formamide 




F 


RNA:RNA 


<50 


V; IxSSC 


Tf*; IxSSC 


G 


DNA:DNA 


2 50 


65 °C' 4xSSC -or- 

42*C; 4xSSC, 50% formamide 




H 


DNA:DNA 


c50 


Th*;4xSSC 


V;4xSSC 


I 




i 50 


45°C; 4xSSC, 50% formamide 


(JJOC- 1 vCC/^ 
0/ IXOO^ 


J 


DNA:RNA 


<50 


T*;4xSSC 


Tj*;4xSSC 


K 


RNA-RNA 
jvi N • rvi ^ 


i. 50 


50°C; 4xSSC, 50% formamide 


o/ \_, xx:»L, 


L 


RNAiRNA 


<50 


V;2xSSC 


V;2xSSC 


M 




> 50 


40°C; 6xSSC, 50% formamide 




N 


DNA:DNA 


<50 


V;6xSSC 


V;6xSSC 


o 




i 50 


SST- 4vQ^C -or- 

42**C; 6xSSC, 50% formamide 




P 


DNArRNA 


<50 


Tp*;6xSSC 


Tp*;6xSSC 


Q 


RNA:RNA 


250 


60°C;4xSSC-or- 

45°C; 6xSSC, 50% formamide 


60"C;2xSSC 


R 


RNA:RNA 


<50 


V;4xSSC 


V;4xSSC 



h The hybrid length is that anticipated for the h)^ridized region(s) of the hybridizing polynucleotides. When 
hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the 

2 5 hyfbrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 

or regions of optimal sequence complementarity. 

*: SSPE (IxSSPE is 0.15M NaCl, lOmM NaHjFO^, and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
(IxSSC is 0.15M NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 
30 *Tb - T^: The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should 
be 5-10*C less than the melting temperature (T^ of the hybrid, where T„ is determined according to the 
following equations. For hybrids less than 18 base pairs in length, T^C'C) = 2(# of A + T bases) + 4(# of G + 
C bases). For hybrids between 18 and 49 base pairs in length, T„(**Q ~ 815 + 16.6(log,o[Na*]) + 0.41(%G+C) - 
(600/N), where N is the number of bases in the hybrid, and [Na*] is the concentration of sodium ions in the 

3 5 hybridization buffer ([Na*] for IxSSC = 0.165 M). 
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Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook, J., E.F. Fritsch, and T. Maniatis, 1989, Molecular Cloning: A 
laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F^. Ausubel et al., eds., 
5 John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
10 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and identity while minimizing 
sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 
15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al. Nucleic Adds Res. 19, 448S4490 (1991), in order to produce the protein 
recombinantiy. Mcmy suitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 
2 0 linked" means that the isolated polynucleotide of the invention and an expression control 
sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 

2 5 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 

Ovary (CHO) cells, human kidney 293 cells, himian epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

3 0 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli. Bacillus subtilis, Salmonella typhimurium, or any bacterial 
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strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
5 enzymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and emplojring an insect expression system. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, 
1 0 e.g., Invitrogen, San Diego, California, U.S.A. (the MaxBac® kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

1 5 The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity coltimn 

2 0 containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 
Sepharose®; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or inununoaffinity 
chromatography. 

2 5 Alternatively, Ihe protein of the invention may also be expressed in a form which 

will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
(TRX), Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 

30 InVitrogen, respectively. The protein can also be tagged with an epitope and 
subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Hag") is commercially available from Kodak (New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
5 with the present invention as an "isolated protein," 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

10 The protein may also be produced by known conventional chemical synthesis. 

Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The s)mthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and /or conformational 
characteristics with proteins may possess biological properties in common therewith, 

15 including protein activity. Thus, they may be employed as biologically active or 
immvmological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immimological processes for the development of antibodies. 

The proteins provided herein also include proteins characterized by amino add 
sequences similar to those of purified proteins but into which modification are naturally 

2 0 provided or deliberately engineered. For example, modifications in the peptide or DNA 

sequences can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 
25 amino add to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., XJS, Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 

3 0 expected to retain protein activity in whole or in part and may thus be useful for screening 

or other immvmological methodologies may also be easily made by those skilled in the art 
given the disdosures herein. Such modificatior\s are believed to be encompassed by the 
present invention. 
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USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the iises or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
5 invention may be provided by administration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 
or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

1 0 The polynucleotides provided by the present invention can be used by the research 

community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 

15 states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 
with endogenous DNA sequences in patients to identify potential genetic disorders; as 
probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" 

2 0 known sequences in the process of discovering other novel polynucleotides; for selecting 

and making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immimization techniques; and as an antigen to raise anti-DNA antibodies or elidt another 
immune response. Where the polynucleotide encodes a protein which binds or potenticdly 
25 binds to another protein (such as, for example, in a receptor-ligand interaction), the 
polynucleotide can also be used in interaction trap assays (such as, for example, those 
described in Gyuris et «/., 1993, Cell 75: 791-803 and in Rossi et ai, 1997, Proc. Natl. Acad. 
Sci. USA 94: 8405-8410, all of which are incorporated by reference herein) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify 

3 0 inhibitors of the binding interaction. 

The proteins provided by the present invention can similarly be used in assay to 
determine biological activity, including in a panel of multiple proteins for high- 
throughput screerung; to raise antibodies or to elicit another immime response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
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levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
5 binds to another protein (such as, for example, in a receptor-ligand interaction), the 
protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

1 0 Any or all of these research utilities are capable of being developed into reagent 

grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skiUed in 

the art. References disclosing such methods include without limitation "Molecular 

Cloning: A Laboratory Manual", 2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, 
15 J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 

Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 
0 nutritional sources or supplements. Such uses include without limitation use as a protein 
or amino add supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
5 capsules. In the case of microorganisms, the protein or polynucleotide of the invention 
can be added to the meditun in or on which the microorganism is cultured. 

Cytokine and Cell Proliferation/EHfferentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 
0 inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 
induce production of other cytokines in certain cell populations. Many protein factors 
discovered to date, including all knovm cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cjrtokine activity. The activity of a protein of the present invention is 
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evidenced by any one of a number of routine factor dependent cell proliferation assays 
for cell lines including, without limitation, 32D, DA2, DAIG, TIO, B9, B9/11, BaF3, 
MC9/G, M+ (preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLU, TF-1, Mo7e and CMK. 

5 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assays for T-cell or thymocyte proliferation include without limitation those 

described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. 

Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
1 0 Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Ftinction 3.1-3.19; Chapter 

7, Immunologic studies in Humans); Takai et aL, J. Immunol. 137:3494-3500, 1986; 

Bertagnolli et al., J. Immunol. 145:1706-1712, 1990; Bertagnolli et al.. Cellular Immunology 

133:327-341, 1991; Bertagnolli, et al., J. Immtmol. 149:3778-3783, 1992; Bowman et al., J. 

Immunol. 152: 1756-1761, 1994, 
15 Assays for cytokine production and / or proliferation of spleen cells, lymph node 

cells or thjmiocytes include, without limitation, those described in: Polyclonal T cell 

stimulation, Kruisbeek, A.M. and Shevach, E.M. In Current Protocols in Immunology. J.E.e.a. 

Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 

Measurement of mouse and human Interferon y, Schreiber, R-D. In Current Protocols in 

2 0 Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and Lipsky, P.E. In Current 
Protocols in Immunology, J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 
25 Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al.. Nature 
336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and himian interleukin 6 - Nordan, R. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.65, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 

3 0 Interleukin 11 - Bennett, P., Giannotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 

in Immunology, J.E.e,a, Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Qark, S.C. 
and Turner, K.J. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991, 
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Assays for T-cel\ clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T<ell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immimology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, 
5 E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Intersdence 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et aL, 
Proc. Nati. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et aL, J. Immunol. 
10 140:508-512, 1988. 

Immune Stimulating or Suppressing Activity 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 

15 are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation of T and /or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell popvdations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 

20 infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmaxua spp., malaria spp. and various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 

25 be useful where a boost to the immime system generally may be desirable, i.e., in the 
treatment of cancer. 

Autoimmime disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmime pulmonary inflammation, 
3 0 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft-versus-host disease and autoimmime inflammatory eye disease. 
Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or otiier respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 



37 



wo 98/49302 



PCT/US98/08336 



example, organ transplantation), may also be treatable using a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to immune responses, 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
5 immune response already in progress or may involve preventing the induction of an 
immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-specific, process which requires 
continuous exposure of the T cells to the suppressive agent- Tolerance, which involves 
10 inducing non-responsiveness or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-specific and persists after exposure to 
the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent. 

1 5 Down regulating or preventing one or more antigen functions (including without 

limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 

2 0 transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 
the transplant. The administration of a molecule which inhibits or blocks interaction of 
a B7 l3rmphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in conjvmction with a 

2 5 monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., 37- 
1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 
molecule to the natural ligand(s) on the immime cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immvine cells, such as T cells, and thus acts as an 

30 immxmosuppressant. Moreover, the lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
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tolerance in a subject it may also be necessary to block the function of a combination of 
B lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
5 humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immtmosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al., Science 257:789-792 (1992) and Turka et al, Proc. Natl. Acad. 
Sci USA, 39:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
10 Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
1 5 activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T ceUs may reduce or eliminate disease s)rmptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptor:ligand interactions of B lymphocyte antigens can be used to inhibit T cell 

2 0 activation and prevent production of autoantibodies or T cell-derived cytokines which 

may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-spedfic tolerance of autoreactive T cells which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmtme 
disorders can be determined using a ntunber of well-characterized animal models of 
25 human autoimmune diseases. Examples include murine experimental autoimmune 
encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, 
murine autoimmtme collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed.. Fundamental Immimology, Raven 
Press, New York, 1989, pp. 840-856). 

3 0 Upregulation of an antigen function (preferably a B lymphocyte antigen function), 

as a means of up regulating immime responses, may also be useful in therapy. 
Upregulation of immime responses may be in the form of enhancing an existing immune 
response or eliciting an initial immime response. For example, enhancing an immime 
response through stimulating B lymphocyte antigen function may be useful in cases of 
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viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
5 by removing T cells from the patient, costimulating the T cells in vitro v/iih viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses vi^ould be to isolate infected cells from a patieiit, transfect them with a nudeic 

10 acid encoding a protein of the present invention as described herein such that the cells 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient. The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 

15 (preferably B lymphocyte antigen function) may be useful in the induction of tumor 
immunity. Tiunor cells {e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic add encoding at least one peptide of the present 
invention can be administered to a subject to overcome tumor-specific tolerance in the 
subject. If desired, the tumor cell can be transfected to express a combination of peptides. 

20 For example, tvmnor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 
transfected tumor cells are returned to the patient to result in expression of the peptides 
on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 

25 to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 
costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC dass II 

3 0 molecules, or which fail to reexpress suffident amounts of MHC class I or MHC class II 
molecules, can be transfected with nucleic add encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I a chain protein and 
microglobulin protein or an MHC dass n a chain protein and an MHC class n p chain 
protein to thereby express MHC dass I or MHC class H proteins on the cell surface. 
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E>£pression of the appropriate class I or dass n MHC in conjunction with a peptide having 
the activity of a B lymphocyte antigen {e.g., B7-1, B7-2, B7-3) induces a T ceU mediated 
immune response against the transfected tumor cell Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class n associated protein, such 
5 as the invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of timior associated 
antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated 
immime response in a human subject may be sufficient to overcome tumor-spedfic 
tolerance in the subject. 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immimology, Ed by J. E. Coligan, A.M. 
Kruisbeek, D.H. Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Assodates 

1 5 and Wiley-Intersdence (Chapter 3, In Vih*o assays for Mouse Lymphocyte Function 3.1- 
3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al,, Proc. Nati. Acad. Sd. 
USA 78:2488-2492, 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., 
J. Immunol. 135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et aL, 
J. Immunol. 140608-512, 1988; Herrmann et al., Proc. Nati. Acad. Sd. USA 78:2488-2492, 

20 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. Immunol. 
135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet aL, J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; BertagnolU et al.. 
Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 
Assays for T-cell-dependent immunoglobulin responses and isotype switching 

25 (which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) indude, witiiout limitation, tiiose described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J.J. and Bnmswick, M. In Current Protocols in Immunology, 
J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

30 Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 

proteins that generate predominantiy Thl and CTL responses) indude, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A,M. Kruisbeek, 
DH. Margulies, E,M. Shevach, W Strober, Pub. Greene Publishing Assodates and Wiley- 
Intersdence (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
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7, Immtinologic studies in Humans); Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai 
et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Etendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
5 described in: Guery et aL, J. Immunol. 134:536-544, 1995; Inaba et al.. Journal of 
Experimental Medicine 173:549-559, 1991; Macatonia et al.. Journal of Immunology 
1545071-5079, 1995; Porgador et al.. Journal of Experimental Medicine 182:255-260, 1995; 
Nair et al.. Journal of Virology 67:4062-4069, 1993; Huang et aL, Science 264:961-965, 
1994; Macatonia et al.. Journal of Experimental Medidne 169:1255-1264, 1989; Bhardwaj 

10 et al., Joximal of Clinical Investigation 94:797-807, 1994; and Inaba et al. Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 

15 et al.. Cytometry 13:795-808, 1992; Gorczyca et al.. Leukemia 7:659-670, 1993; Gorczyca et 
al.. Cancer Research 53:1945-1951, 1993; Itoh et al.. Cell 66:233-243, 1991; Zacharchuk, 
Journal of Immimology 145:4037-4045, 1990; Zamai et al.. Cytometry 14:891-897, 1993; 
Gorczyca et al.. International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 

20 development include, without limitation, those described in: Antica et al.. Blood 
84:111-117, 1994; Fine et al.. Cellular Immunology 155:111-122, 1994; Galy et al.. Blood 
85:2770-2778, 1995; Toki et al., Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 

Hematopoiesis Regulating Activity 

25 A protein of the present invention may be useful in regulation of hematopoiesis 

and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 
lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor celk alone or in combination with other cytokines, 

30 thereby indicating utility, for example, in treating various anemias or for use in 
conjimction with irradiation/chemotherapy to stimulate the production of erythroid 
precursors and/ or erythroid cells; in supporting the growth and proliferation of myeloid 
cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
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myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
disorders such as tiirombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and /or in supporting the growth and proliferation of 
5 hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as weU as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 

10 ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

1 5 Suitable assays for proliferation and differentiation of various hematop>oietic lines 

are dted above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryoiuc differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
20 al.. Molecular and CeUular Biology 13:473-486, 1993; McClanahan et al.. Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 

2 5 Hematopoietk Cells, R.I. Freshney, et al eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 

NY. 1994; Hirayama et al., Proc. Natl. Acad. Sd. USA 89:5907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, I.K. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994; Neben et al. Experimental Hematology 22:353-359, 

3 0 1994; Cobblestone area forming cell assay, Ploemacher, R.E, In Culture of Hematopoietic 

Cells, R.L Freshney, et al eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells. R.I. Freshney, et al eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture iiutiating cell assay, Sutheriand, 
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H j. In Culture of Hematopoietic Cells, R.L Freshney, et at. eds. Vol pp. 139-162, Wiley-Uss, 
Inc., New York, NY. 1994. 

Tissue Growth Activity 
5 A protein of the present invention also may have utility in compositions used for 

bone, cartilage, tendon, ligament and /or nerve tissue growth or regeneration, as well as 
for wound healing and tissue repair and replacement, and in the treatment of bums, 
incisions and ulcers. 

A protein of the present invention, which induces cartilage and /or bone growth 
10 in drciraistances where bone is not normally formed, has application in the heding of 
bone fractures and cartilage damage or defects in hiunans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
15 congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
to attract bone-forming cells, stimulate growth of bone-forming cells or induce 
2 0 differentiation of progenitors of bone-forming cells. A protein of the invention may also 
be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and /or cartUage repair or by blocking inflammation or processes of tissue 
destruction (coUagenase activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

25 Another category of tissue regeneration activity that may be attributable to the 

protein of the present invention is tendon/ ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or other tissue formation in 
circumstances where such tissue is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in himians and 

30 other animals. Such a preparation employing a tendon /ligament-like tissue inducing 
protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon /ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
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congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
5 differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
timnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and /or sequestering agent as a carrier as is well known in 
10 the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 

15 tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, j>eripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present 

2 0 invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
invention- 
Proteins of the invention may also be useful to promote better or faster closure of 

2 5 non-healing wotmds, including without limitation pressure ulcers, ulcers associated with 

vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), musde (smooth, skeletal or cardiac) 

3 0 and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 

comprising such tissues. Part of the desired effects may be by inhibition or modulation 
of fibrotic scarring to allow normcd tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 
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A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of limg or liver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
5 di^rentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The activity of a protein of the invention may, among other means, be measured 
by the foUowing methods: 

Assays for tissue generation activity include, without limitation, those described 
10 in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endotheliimi ). 

Assays for woimd healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.). Year 
15 Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 

Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
20 activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers with a member of the inhibin a family, may be useful 
as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 
2 5 and decrease spermatogenesis in male mammals. Administration of sufficient amounts 
of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin- 
P group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 
30 United States Patent 4,798,885. A protein of the invention may also be useful for 
advancement of the onset of fertility in sexually immature mamnuds, so as to increase the 
lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 
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Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al.. Endocrinology 91:562-572, 1972; Ling et al.. Nature 321:779-782, 1986; Vale et 
al.. Nature 321:776-779, 1986; Mason et al. Nature 318:659-663, 1985; Forage et al., Proc. 
Natl. Acad. Sd. USA 83:3091-3095, 1986. 

5 

Q\emotactic/Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
(e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. 
1 0 Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 
population to a desired site of action. Chemotactic or chemokinetic proteins provide 
particular advantages in treatment of woimds and other trauma to tissues, as well as in 
treatment of localized infections. For example, attraction of lymphocytes, monocytes or 
neutrophils to tumors or sites of infection may result in improved immune responses 
1 5 against the tumor or infecting agent. 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 
20 of cells can be readily determined by employing such protein or peptide in any known 
assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

2 5 chemotaxis) consist of assays tiiat measure the ability of a protein to induce the migration 

of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immunology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 

3 0 Publishing Associates and Wiley-Intersdence (Chapter 6.12, Measurement of alpha and 

beta Chemokines 6.12.1-6.12.28; Taub et al. J. Clin. Invest 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994. 
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Hemostatic and Thrombolytic Activity 

A protein of the inyention may also exhibit hemostatic or thrombolytic activity. 
As a result such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
5 and other hemostatic events in treating v^^ounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or inhibiting 
formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: linet et aL, J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al.. Thrombosis 
Res. 45:413-419, 1987; Humphrey et al.. Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 
35:467-474, 1988. 

Receptor /Ligand Activity 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of 
such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-<:ell interactioi\s and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 
development of cellular and humoral immune responses). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described in:Current Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, E.M. Shevach, W-Strober, Pub. Greene Publishing Associates and 
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Wiley-Intersdence (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 7.28.1-7.28.22), Takai et al., Proc. Natl. Acad. Sd. USA 84:6864-6868, 1987; 
Biereretal., J. Exp. Med. 168:1145-1156, 1988; Rosenstein etal., J. Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 17559-68, 1994; Stitt et al.. Cell 80:661-670, 
5 1995. 

Anti-Inflammatory Activity 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 

10 the inflammatory response, by inhibiting or promoting cell-<:ell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 

15 inflamjtnatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 

2 0 from over production of cytokines such as TNF or IL-1 . Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

Cadherin /Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 

2 5 roles during development, particularly in defining specific cell types. Loss or alteration 

of normal cadherin expression can lead to changes in cell adhesion properties linked to 
tumor growth and metastasis. Cadherin malfunction is also implicated in other hiunan 
diseases, such as pemphigus vulgaris and pemphigus foliaceus (auto-immune blistering 
skin diseases), Crohn's <lisease, and some developmental abnormalities. 

3 0 The cadherin superfamily includes well over forty members, each with a distinct 

pattern of expression. All members of the superfamily have in common conserved 
extracellular repeats (cadherin domairts), but structural differences are fovmd in other 
parts of the molecule. The cadherin domains bind caldtim to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino adds in the 
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first cadherin domain provide the bcisis for homophilic adhesion; modification of this 
recognition site can change the specificity of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophilic adhesion with other cadherins. 
5 E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 

types. Pathologically, if E<adherin expression is lost in a tumor, the malignant cells 
become invasive and the cancer metastasizes. Transfection of cancer ceU lines with 
polynucleotides expressing EH:adherin has reversed cancer-assodated changes by 
returning altered cell shapes to normal, restoring cells' adhesiveness to each other and to 

10 their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 

15 encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing normal cadherin expression. 

Cancer cells have also been shown to express cadherins of a different tissue type 
than their origin, thus allowing these cells to invade and metastasize in a different tissue 

20 in the body. Proteins of the present invention with cadherin activity, and polynucleotides 
of the present invention encoding such proteins, can be substituted in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eliminating the tendency of the cells to metastasize. 

Additionally, proteins of the present invention with cadherin activity, and 

2 5 polynucleotides of the present invention encoding such proteins, can used to generate 

antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
forming a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 

3 0 cancer, the less cadherin expression there will be, and this decrease in cadherin expression 

can be detected by the use of a cadherin-binding antibody. 

Fragments of proteins of the present invention with cadherin activity, preferably 
a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 
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to blcKk cadherm function by binding to cadherins and preventing them from binding in 
ways that produce undesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
5 encoding such protein fragments, can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

10 Tumor Inhibition Activity 

In addition to the activities described above for immimological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
ADCC). A protein may exhibit its tumor inhibitory activity by acting on tumor tissue or 

15 tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for example, by inhibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 
or inhibiting factors, agents or cell types which promote tumor growth. 

20 Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 

2 5 weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cydes or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimiimtion of dietary fat, lipid, protein, 

3 0 carlwhydrate, vitamins, minerals, cofactors or other nutritional factors or component(s); 

effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
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lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyp>erproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
5 in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 



ADMINISTRATION AND DOSING 

10 A protein of the present invention (from whatever source derived, including 

without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutically acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 

1 5 "pharmaceutically acceptable" means a non-toxic material that does not interfere with the 
effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, II^l, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, 

2 0 IL-12, IL-13, IH4, IL-15, IFN, TNFO, TNFl, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem 
cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the invention, 

25 or to minimize side effects. Conversely, protein of the present invention may be included 
in formulations of the particular cytokine, Ijonphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent. 

30 A protein of the present invention may be active in multimers (e.g., heterodimers 

or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 
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The pharmaceutical composition of the invention may be in the form of a complex 
of the protein(s) of present invention along with protein or peptide antigens. The protein 
and/or peptide antigen will deliver a stimulatory signal to both B and T lymphoqrtes. B 
lymphocytes will respond to antigen through their surface immimoglobulin receptor. T 
5 l)rmphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
including those encoded by dass I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules tiiat 
1 0 can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as weU as antibodies able to bind the TCR and other 
molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

The pharmaceutical composition of the invention may be in the form of a liposome 
15 in which protein of the present invention is combined, in addition to other 
pharmaceuticaUy acceptable carriers, with amphipathic agents such as lipids which exist 
in aggregated form as miceDes, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysoledthin, phospholipids, saponin, bile acids, 
2 0 and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in VS. Patent No. A;235JS71; U.S. Patent No. 4,501,728; U.S. 
Patent No. 4337^)28; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

As used herein, the term "therapeutically effective iimount" means the total 
2 5 amount of each active component of the pharmaceutical composition or method that is 
sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
' amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 
30 a combination, the term refers to combined amounts of the active ingredients that result 
in the therapeutic effect, whether administered in combination, serially or simultaneously. 

In practicing the method of treatment or use of the present invention, a 
therap>eutically effective amoimt of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
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administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
5 administered either simultaneously with the cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If 
admiiustered sequentially, the attending physician wiU decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-fhrombotic 
10 factors. 

Administration of protein of the present invention used in the pharmaceutical 
composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 

1 5 Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 

20 an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
When administered in liquid form, a liquid carrier such as water, petroletmi, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 

25 physiological saline solution, dextrose or other saccharide solution, or glycols such as 
ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 
form, the pharmaceutical composition contains from about 05 to 90% by weight of protein 
of the present invention, and preferably from about 1 to 50% protein of the present 
invention, 

3 0 When a therapeutically effective amount of protein of the present invention is 

administered by intravenous, cutaneous or subcutaneous injection, protein of the present 
invention will be in the form of a p)a-ogen-free, parenterally acceptable aqueous solution. 
The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonidty, stability, and the like, is within the skiU in the art. A preferred 
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pharmaceutical composition for intravenous^ cutaneous, or subcutaneous injection should 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 
Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The 
5 pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 

The amount of protein of the present invention in the pharmaceutical composition 
of the present invention will depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has undergone. 

10 Ultimately, the attending physician will decide the amoimt of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 
will admiruster low doses of protein of the present invention and observe the patient's 
response. Larger doses of protein of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 

1 5 increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 pg to about 100 
mg (preferably about O.lng to about 10 mg, more preferably about 0.1 ^ig to about 1 mg) 
of protein of the present invention per kg body weight. 

The duration of intravenous therapy using the pharmaceutical composition of the 

2 0 present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 
contemplated that the duration of each application of the protein of the present invention 
will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will decide on the appropriate duration of intravenous 

2 5 therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 
polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 
immimogen. The peptide immimogens additionally may contain a cysteine residue at the 

3 0 carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin 

(KLH). Methods for synthesizing such peptides are known in the art, for example, as in 
R.P. Merrifield, J. Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 
211/ 10 (1987). Monoclonal antibodies binding to tiie protein of the invention may be 
useful diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
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antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
5 and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 

10 administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 
be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 

15 optionally be included in the composition as described above, may alternatively or 
additionaUy, be administered simultaneously or sequentially with the composition in the 
methods of the invention. Preferably for bone and /or cartilage formation, the 
composition would include a matrix capable of delivering the protein<ontaining 
composition to the site of bone and /or cartilage damage, providing a structure for the 

2 0 developing bone and cartilage and optimally capable of being resorbed into the body. 
Such matrices may be formed of materials presentiy in use for other implanted medical 
applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 

25 application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 
sulfate, tricaldumphosphate, hydroxyapatite, polylactic add, polyglycoiic add and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermcd collagen. Further matrices are comprised of pure proteins 

30 or extracellular matrix components. Other potential matrices are nonbiodegradable and 
chemically defined, such as sintered hydroxapatite, bioglass, alimiinates, or other 
ceramics. Matrices may be comprised of combinations of any of the above mentioned 
types of material, such as polylactic add and hydroxyapatite or coUagen and 
tricaldumphosphate. The bioceramics may be altered in composition, such as in caldum- 
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altiminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 

Presently preferred is a 5050 (mole weight) copolymer of lactic acid and glycolic 
acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
5 In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood dot, to prevent the protein compositions 
from disassociating from the matrix. 

A preferred family of sequestering agents is ceUulosic materials such as 
alkylcelluloses (including hydroxyalkylcelluloses), including methylcellulose, 

10 ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 
carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 
acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 05-20 

15 wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 
provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infQtrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 

20 In further compositions, proteins of the invention may be combined with other 

agents beneficial to the treatment of the bone and /or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), h-ansforming growth factors (TGF-a and 
TGF-p), and insulin-like growth factor (IGF). 

25 The therapeutic compositions are also presently valuable for veterinary 

applications. Particularly domestic animals and thoroughbred horses, in addition to 
humans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-containing pharmaceutical composition to be 
used in tissue regeneration will be determined by the attending physician considering 

3 0 various factors which modify the action of the proteins, e.g., amoimt of tissue weight 
desired to be formed, the site of damage, the condition of the damaged tissue, the size of 
a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any infection, time of administration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitution and with inclusion of other proteins in 
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the pharmaceutical composition. For example, the' addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect 
the dosage. Progress can be monitored by periodic assessment of tissue /bone growth 
and /or repair, for example. X-rays, histomorphometric determinations and tetracycline 
5 labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
1 0 limitation, in the form of viral vectors or naked DNA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 
invention in order to proliferate or to produce a desired effect on or activity in such cells. 
Treated cells can then be introduced in vivo for therapeutic piu*poses. 

15 Patent and literature references cited herein are incorporated by reference as if 

fully set forth. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Jacobs, Kenneth 
McCoy, John M. 
LaVallie, Edward R. 
Racie, Lisa A. 
Me rber g , Da vi d 
Treacy, Maurice 
Spaulding, Vikki 
Agostino, Michael J. 

(ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 
ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 25 

Uv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Sprunger, Suzcunne A. 

(B) REGISTRATION NUMBER: 41,323 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8284 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

AGGCGCCCTC CCTTCCTGAG GAGCTGTTGG CCTGGGTGGG CAGGAACTGC AGTATGGCCA 60 

TGGGCTGAGC AGGCTGAGCA CCTCAGCCTT TAGGGCTTAT GGCCAGGGGA CACTGTATGA 120 

CTCTCCTCTC CTGCAGGTGT CTATCCACCT GGGGTATGGC ATCTACCGAC CTGTCTCCCT 180 

GGGGTCACAT GCTTTGTTTC CATTCTTGTC CTGGCTGGAC CAGCCACTGT GGGACCAACA 240 

CCCCTCCCAC ACTCCCCCAG ACTGCTCGTC TATCACCAGG ATCGCTTTGT ACTTTGTGCA 300 

AAAGGGTCTG GCTGTCCCTT GCTGTTTTCA TCTCTGCCAA GCCTATTGTG CCTCTGGCTG 360 

CTGTATGTGT GCGCGTGCAC GTGTGTGTGT TTCATCTGTT CATTCACTGC ACAAGATATT 420 

TATTGAGTGC CCACTACGTG CCAGGCACTG TTGCTGAGTT CCTGTGGGTG TGTCTCTCGA 480 

TGCCACTCCT GCTTCTCTGG GGGCCTCTTT CTGTGCTTCT CTTTGTCCCC AAATTGCTAC 540 

CTCTTTGTCA GTCTGGGTGT CTCAGGTTCT GTGTGTCCTT GTGTGCATTT CTGTCTCTCT 600 

CTGTCCTCGT CTCTCTGCAA GGCCCTCTAT TTCTCTCTTT CTTGGTGTCT GTCCTTTGCC 660 

CCCTGTGCCC TCTGGATTCT CTGGGTCTAT GTAGGCCCCT GGTCTGCCCT GGGCTCATCA 720 

GCCTTCCTGA CCTCCTCCTG CCCTCCCCTT CACTCCCTCC CTGGCTCTGC CAGTCGGTTC 780 

CCACGGAGCC ATTTTTAGCT CTGATCAGCA TGGGAATGTG CCTCGGCCTC CAAGGGGCTT 840 

TGTCCTGGTG CCCCCGCCCC TGGTCCCAAC CTGATCCCAC GAGGGAGTTG GGACAGGAGG 900 

ATTGATGGTG CTCCCCTTCC TGCCAGCGTC AGAGGCCCTG GAGAGGGGCT GTCCATGGCA 960 

GCTGGTCTTT ATTCCTCCCT CATGAGCACA GGGTCGGGGG GTCCCCATTC TTGGAAGAGG 1020 

TTGAGAAGAC TCCTGGGCTT CAGCCTCTCC CACCCAGCCC TGCCCCTCAC CTGCCTGCCC 1080 

TCCCCTCCCC CACTCTATAC TAGGGACTGG ATCTCAGCCT CTGATCAGTT TCACAAAGTT 1140 

TGTTCCCTAA GGAAATCAAA TCCCATTGTC ACCTAACTCT GAAGATCTAA ATAGCCCTTG 1200 

GATCAGTACG GGAACCCCAA ATCCCACAGG GCCAGATGTG GAGTCTGTGT CTGCCCCCGT 1260 

CTTCTCTCCA TCCTCAAAGC CCCCACTTCT CTCCAGGCTG TTTCTTTTTT TATGACTGTA 1320 

AACATAGATA GTGCTTTATT TTGTTAATAA TAAGATAATG ATGAGTAACT TAACCAGCAC 1380 

ATTTCTCCTG TTTACACTCG GGGGATTTTT TTGTTTTCTG ATGACATAAT AAAGACAGAT 1440 
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CATTTCAGAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1480 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Ala Arg Gly His Cys Met Thr Leu Leu Ser Cys Arg Cys Leu Ser 
15 10 15 

Thr Trp Gly Met Ala Ser Thr Asp Leu Ser Pro Trp Gly His Met Leu 
20 25 30 

Cys Phe His Ser Cys Pro Gly Trp Thr Ser His Cys Gly Thr Asn Thr 
35 40 45 

Pro Pro Thr Leu Pro Gin Thr Ala Arg Leu Ser Pro Gly Ser Leu Cys 
50 55 60 

Thr Leu Cys Lys Arg Val Trp Leu Ser Leu Ala Val Phe lie Ser Ala 
65 70 75 80 

Lys Pro lie Val Pro Leu Ala Ala Val Cys Val Arg Val His Val Cys 
85 90 95 

Val Phe His Leu Phe lie His Cys Thr Arg Tyr Leu Leu Ser Ala His 
100 105 110 

Tyr Val Pro Gly Thr Val Ala Glu Phe Leu Trp Val Cys Leu Ser Met 
115 120 125 

Pro Leu Leu Leu Leu Trp Gly Pro Leu Ser Val Leu Leu Phe Val Pro 
130 135 140 

Lys Leu Leu Pro Leu Cys Gin Ser Gly Cys Leu Arg Phe Cys Val Ser 
145 150 155 160 

Leu Cys Ala Phe Leu Ser Leu Ser Val Leu Val Ser Leu Gin Gly Pro 
165 170 175 

Leu Phe Leu Ser Phe Leu Val Ser Val Leu Cys Pro Leu Cys Pro Leu 
180 185 190 

Asp Ser Leu Gly Leu Cys Arg Pro Leu Val Cys Pro Gly Leu lie Ser 
195 200 205 
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' Leu Pro Asp Leu Leu Leu Pro Ser Pro Ser Leu Pro Pro Trp Leu Cys 
210 215 220 

Gin Ser Val Pro Thr Glu Pro Phe Leu Ala Leu lie Ser Met Gly Met 
225 230 235 240 

Cys Leu Gly Leu Gin Gly Ala Leu Ser Trp Cys Pro Arg Pro Trp Ser 
245 250 255 

Gin Pro Asp Pro Thr Arg Glu Leu Gly Gin Glu Asp 
260 265 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CCCGGCGGCT CCTGGAACCC CGGTTCGCGG CGATGCCAGC CACCCCAGCG AAGCCGCCGC 60 

AGTTCAGTGC TTGGATAATT TGAAAGTACA ATAGTTGGTT TCCCTGTCCA CCCGCCCCAC 120 

TTCGCTTGCC ATCACAGCAC GCCTATCGGA TGTGAGAGGA GAAGTCCCGC TGCTCGGGCA 180 

CTGTCTATAT ACGCCTAACA CCTACATATA TTTTAAAAAC ATTAAATATA ATTAACAATC 240 

AAAA6AAAGA GGAGAAAGGA AGGGAAGCAT TACTGGGTTA CTATGCACTT GCGACTGATT 300 

TCTTGGCTTT TTATCATTTT GAACTTTATG GAATACATCG GCAGCCAAAA CGCCTCCCGG 360 

GGAAGGCGCC AGCGAAGAAT GCATCCTAAC GTTAGTCAAG GCTGCCAAGG AGGCTGTGCA 420 

ACATGCTCAG ATTACAATGG ATGTTTGTCA TGTAAGCCCA GACTATTTTT TGCTCTGGAA 480 

AGAATTGGCA TGAAGCAGAT TGGAGTATGT CTCTCTTCAT GTCCAAGTGG ATATTATGGA 540 

ACTCGATATC CAGATATAAA TAAGTGTACA AAATGCAAAG CTGACTGTGA TACCTGTTTC 600 

AACAAAAATT TCTGCACAAA ATGTAAAAGT GGATTTTACT TACACCTTGG AAAGTGCCTT 660 

GACAATTGCC CAGAAGGGTT GGAAGCCAAC AACCATACTA TGGAGTGTGT CAGTATTGTG 720 

CACTGTGAGG TCAGTGAATG GAATCCTTGG AGTCCATGCA CGAAGAAGGG AAAAACATGT 780 

GGCTTCAAAA 6AGGGACTGA AACACGGGTC CGAGAAATAA TACAGCATCC TTCAGCAAAG 840 
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GGTAACCTGT GTCCCCCAAC AAATGAGACA AGAAAGTGTA CAGTGCAAAG GAAGAAGTGT 900 

CAGAAGGGAG AACGAGGAAA AAAAGGAAGG GAGAGGAAAA GAAAAAAACC TAATAAAGGA 960 

GAAAGTAAAG AAGCAATACC TGACAGCAAA AGTCTGGAAT CCAGCAAAGA AATCCCAGAG 1020 

CAACGAGAAA ACAAACAGCA GCAGAA6AAG CGAAAAGTCC AAGATAAACA GAAATCGGGG 1080 

ATTGAAGTCA CCCTAGCTGA AGGCCTCACC AGTGTTTCAC AGAGGACACA GCCCACCCCT 1140 

TGCAGGAGGA GGTATCTCTG AGTGTGCAGC ACAGAATCGC ATGACCCACC TTAACCTTCC 1200 

TGTTGTCATG GAAGGATGCA CGGCTGCTCT GTCCACTGTG ATTCCTAGCC CTCTCAAGAT 1260 

CACTGCTTTC TGAAGAATTT GCAATGACTC TGGCTTCTGG CTGCTTATCT CTGGACACCC 1320 

GTTCTCCACC AGTTGTACAG TTCATGTAAT CTACTTGGCT TAATTGATTT TCCACTTCTC 1380 

TCTTCCTCTT CTAAGATATA AACATTTTAA ATGATTTAAA AAAAAAAAAA AAAAAA 1436 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met His Leu Arg Leu lie Ser Trp Leu Phe lie lie Leu Asn Phe Met 
15 10 15 

Glu Tyr He Gly Ser Gin Asn Ala Ser Arg Gly Arg Arg Gin Arg Arg 
20 25 30 

Met His Pro Asn Val Ser Gin Gly Cys Gin Gly Gly Cys Ala Thr Cys 
35 40 45 

Ser Asp Tyr Asn Gly Cys Leu Ser Cys Lys Pro Arg Leu Phe Phe Ala 
50 55 60 

Leu Glu Arg He Gly Met Lys Gin He Gly Val Cys Leu Ser Ser Cys 
65 70 75 80 

Pro Ser Gly Tyr Tyr Gly Thr Arg Tyr Pro Asp He Asn Lys Cys Thr 
85 90 95 

Lys Cys Lys Ala Asp Cys Asp Thr Cys Phe Asn Lys Asn Phe Cys Thr 
100 105 110 
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Lys Cys Lys Ser Gly Phe Tyr Leu His Leu Gly Lys Cys Leu Asp Asn 
115 120 125 

Cys Pro Glu Gly Leu Glu Ala Asn Asn His Thr Met Glu Cys Val Ser 
130 135 140 

lie Val His Cys Glu Val Ser Glu Trp Asn Pro Trp Ser Pro Cys Thr 
145 150 155 160 

Lys Lys Gly Lys Thr Cys Gly Phe Lys Arg Gly Thr Glu Thr Arg Val 
165 170 175 

Arg Glu lie lie Gin His Pro Ser Ala Lys Gly Asn Leu Cys Pro Pro 
180 185 190 

Thr Asn Glu Thr Arg Lys Cys Thr Val Gin Arg Lys Lys Cys Gin Lys 
195 200 205 

Gly Glu Arg Gly Lys Lys Gly Arg Glu Arg Lys Arg Lys Lys Pro Asn 
210 215 220 

Lys Gly Glu Ser Lys Glu Ala lie Pro Asp Ser Lys Ser Leu Glu Ser 
225 230 235 240 

Ser Lys Glu lie Pro Glu Gin Arg Glu Asn Lys Gin Gin Gin Lys Lys 
.245 250 255 

Arg Lys Val Gin Asp Lys Gin Lys Ser Gly lie Glu Val Thr Leu Ala 
260 265 270 

Glu Gly Leu Thr Ser Val Ser Gin Arg Thr Gin Pro Thr Pro Cys Arg 
275 280 285 

Arg Arg Tyr Leu 
290 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGTTAAGAGC AGATTAGAAC AGAAATCAGG AGAACTTGGG AAGAAGATCA CTGAGTTAAC 60 
ATTGAAAAAT CAGACACTAC AAAAGGAAAT TGAAAAAGTT TATTTGGATA ATAAGCTCCT 120 
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CAAGGAGCAA GCACATAACT TAACAATTGA AATGAAAAAT CATTATGTTC CTTTAAAAGT 180 

AAGTGAAGAC ATGAAAAAGT CACATGATGC AATTATTGAT GATCTTAATA GAAAGCTTTT 240 

AGATGTAACA CAAAAATATA CAGAAAAGAA GTTGGAAATG GAGAAATTGC TACTGGAAAA 300 

TGACAGCTTA AGTAAGGATG TAAGCCGCCT AGAAACTGTG TTTGTACCTC CTGAGAAACA 360 

TGAAAAAGAG ATAATAGCTC TGAAATCCAA TATTGTTGAA CTTAAGAAAC AGCTGTCTGA 420 

ACTTAAGAAA AAATGTGGTG AAGACCAGGA GAAAATACAC GCTCTCACAT CTGAAAACAC 480 

TAACTTGAAG AAGATGATGA GTAATCAGTA TGTGCCAGTT AAAACCCATG AAGAGGTTAA 540 
AATGACACTG AATGACACGT TAGCCAAAAC TAACAGAGAA TTATTAGATG TGAAGAAAAA 600 
ATTTGAAGAT ATAAATCAGG AATTTGTAAA AATAAAAGAT AAGAATGAAA TATTAAAAAG 660 

AAACCTGGAA AACACTCAGA ACCAAATAAA AGCTGAGTAC ATCAGCCTGG CAGAGCACGA 720 
GGCAAAGATG AGCTCGCTAA GTCAGAGCAT GAGAAAGGTG CAGGATAGTA ATGCTGAAAT 780 
CTTGGCCAAC TACAGAAAAG GCCAAGAAGA GATTGTGACA CTGCATGCCG AAATTAAAGC 840 
CCAGAAGAAG GAGCTCGACA CAATACAAGA ATGCATTAAG GTAAAATAT6 CCCCAATTGT 900 
CAGCTTTGAG GAGTGCGAGA GAAAATTTAA AGCAACAGAG T^GAACTAA AAGACCAGTT 960 

ATCAGAGCAG ACACAAAAGT ATAGTGTCAG TGAAGAAGAA GTCAAGAAAA ACAAGCAAGA 1020 

GAATGACAAG TTAAAGAAGG AGATTTTTAC CCTTCAGAAA GATTTGAGAG ATAAGACAGT 1080 

TCTCATTGAG AAGTCTCATG AAATGGAAAG AGCATTAAGC AGAAAAACAG ACGAGCTAAA 1140 

CAAACAGTTA AAAGACTTGT CACAGAAATA CACGGAAGTA AAGAATGTGA AAGAGAAGCT 1200 

AGTAGAAGAA AATGCCAAAC AGACTTCTGA GATACTTGCA GTGCAAAATC TTTTGCAAAA 1260 

ACAACATGTT CCATTGGAAC AGGTTGAGGC TCTGAAAAAA TCTCTTAATG GCACAATTGA 1320 

AAATCTAAAG GAAGAACTGA AGAGTATGCA AAGGTGTTAC GAGAAAGAGC AGCAGACAGT 1380 

GACCAAACTG CATCAATTGT TGGAGAATCA AAAGAACTCT TCTGTACCCC TGGCAGAGCA 1440 

TTTGCAGATT AAAGAAGCAT TTGAGAAAGA AGTTGGAATC ATAAAAGCCA GCTTGAGAGA 1500 

AAAGGAAGAA GAAAGCCAAA ACAAAATGGA AGAAGTCTCC AAACTTCAGT CGGAGGTTCA 1560 

GAATACTAAA CAAGCATTAA AAAAATTAGA GACTAGAGAG GTAGTTGACT TGTCTAAATA 1620 

TAAAGCAACA AAAAGTGATT TGGAGACACA GATTTCTAGC TTAAATGAAA AATTGGCCAA 1680 

TCTGAATAGA AAGTATGAGG AAGTATGTGA GGAAGTTTTG CATGCCAAAA AGAAGGAAAT 1740 

ATCTGCAAAA GATGAGAAGG AATTACTGCA TTTCAGCATT GAGCAAGAAA TTAAGGATCA 1800 
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GAAGGAACGA TCTGATAAGT CCTTAACAAC AATCACAGAG TTACAAAGAA GAATACAAGA 1860 

ATCTGCTAAA CAAATAGAAG CAAAAGATAA TAAGATAACT GAACTGCTTA ATGATGTGGA 1920 

AAGATTAAAA CAGGCACTCA ATGGCCTTTC CCAACTCACC TACACAAGTG GGAACCCCAC 1980 

CAAGAGGCAG AGCCAGCTGA TTGACACTCT GCAGCACCAA GTGAAATCTC TGGAGCAACA 2040 

GCTGGCCGAT GCTGACAGAC AGCACCAAGA AGTAATTGCA ATTTATCGGA CACACCTTCT 2100 

TAGTGCTGCA CAGGGTCACA TGGATGAAGA TGTTCAGGAG GCTCTGCTCC AGATCATACA 2160 

AATGCGGCAG GGGCTTGTGT GCTAGCCGTT AGCACTGACT GCCAGTATCT GTTTTATCTT 2220 

GCTGGTGCTG AACATTCTTT GTGCAACTCC ATGGTCTTTC TGGGCCTTAC TGTGCTGGTA 2280 

TAATTAAAAT AAAATATATT TTGTTCTAAA AAAAAAAAAA AA 2322 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 677 cunino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Lys Asn His Tyr Val Pro Leu Lys Val Ser Glu Asp Met Lys Lys 
15 10 15 

Ser His Asp Ala lie He Asp Asp Leu Asn Arg Lys Leu Leu Asp Val 
20 25 30 



Thr Gin Lys Tyr Thr Glu Lys Lys 
35 40 

Glu Asn Asp Ser Leu Ser Lys Asp 
50 55 

Val Pro Pro Glu Lys His Glu Lys 
65 70 

He Val Glu Leu Lys Lys Gin Leu 
85 

Glu Asp Gin Glu Lys He His Ala 
100 

Lys Lys Met Met Ser Asn Gin Tyr 



Leu Glu Met Glu Lys Leu Leu Leu 
45 

Val Ser Arg Leu Glu Thr Val Phe 
60 

Glu He He Ala Leu Lys Ser Asn 
75 80 

Ser Glu Leu Lys Lys Lys Cys Gly 
90 95 

Leu Thr Ser Glu Asn Thr Asn Leu 
105 110 

Val Pro Val Lys Thr His Glu Glu 
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115 120 125 

Val Lys Met Thr Leu Asn Asp Thr Leu Ala Lys Thr Asn Arg Glu Leu 
130 135 140 

Leu Asp Val Lys Lys Lys Phe Glu Asp He Asn Gin Glu Phe Val Lys 
145 150 155 160 

He Lys Asp Lys Asn Glu He Leu Lys Arg Asn Leu Glu Asn Thr Gin 
165 170 175 

Asn Gin He Lys Ala Glu Tyr He Ser Leu Ala Glu His Glu Ala Lys 
180 185 190 

Met Ser Ser Leu Ser Gin Ser Met Arg Lys Val Gin Asp Ser Asn Ala 
195 200 205 

Glu He Leu Ala Asn Tyr Arg Lys Gly Gin Glu Glu He Val Thr Leu 
210 215 220 

His Ala Glu He Lys Ala Gin Lys Lys Glu Leu Asp Thr He Gin Glu 
225 230 235 240 

Cys He Lys Val Lys Tyr Ala Pro He Val Ser Phe Glu Glu Cys Glu 
245 250 255 

Arg Lys Phe Lys Ala Thr Glu Lys Glu Leu Lys Asp Gin Leu Ser Glu 
260 265 270 

Gin Thr Gin Lys Tyr Ser Val Ser Glu Glu Glu Val Lys Lys Asn Lys 
275 280 285 

Gin Glu Asn Asp Lys Leu Lys Lys Glu He Phe Thr Leu Gin Lys Asp 
290 295 300 

Leu Arg Asp Lys Thr Val Leu He Glu Lys Ser His Glu Met Glu Arg 
305 310 315 320 

Ala Leu Ser Arg Lys Thr Asp Glu Leu Asn Lys Gin Leu Lys Asp Leu 
325 330 335 

Ser Gin Lys Tyr Thr Glu Val Lys Asn Val Lys Glu Lys Leu Val Glu 
340 345 350 

Glu Asn Ala Lys Gin Thr Ser Glu He Leu Ala Val Gin Asn Leu Leu 
355 360 365 

Gin Lys Gin His Val Pro Leu Glu Gin Val Glu Ala Leu Lys Lys Ser 
370 375 380 

Leu Asn Gly Thr He Glu Asn Leu Lys Glu Glu Leu Lys Ser Met Gin 
385 390 395 400 

Arg Cys Tyr Glu Lys Glu Gin Gin Thr Val Thr Lys Leu His Gin Leu 
405 410 415 
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Leu Glu Asn Gin Lys Asn Ser Ser Val Pro Leu Ala Glu His Leu Gin 
420 425 430 

lie Lys Glu Ala Phe Glu Lys Glu Val Gly lie lie Lys Ala Ser Leu 
435 440 445 

Arg Glu Lys Glu Glu Glu Ser Gin Asn Lys Met Glu Glu Val Ser Lys 
450 455 460 

Leu Gin Ser Glu Val Gin Asn Thr Lys Gin Ala Leu Lys Lys Leu Glu 
465 470 475 480 

Thr Arg Glu Val Val Asp Leu Ser Lys Tyr Lys Ala Thr Lys Ser Asp 
485 490 495 

Leu Glu Thr Gin lie Ser Ser Leu Asn Glu Lys Leu Ala Asn Leu Asn 
500 505 510 

Arg Lys Tyr Glu Glu Val Cys Glu Glu Val Leu His Ala Lys Lys Lys 
515 520 525 

Glu He Ser Ala Lys Asp Glu Lys Glu Leu Leu His Phe Ser He Glu 
530 535 540 

Gin Glu He Lys Asp Gin Lys Glu Arg Cys Asp Lys Ser Leu Thr Thr 
545 550 555 560 

He Thr Glu Leu Gin Arg Arg He Gin Glu Ser Ala Lys Gin He Glu 
565 570 575 

Ala Lys Asp Asn Lys He Thr Glu Leu Leu Asn Asp Val Glu Arg Leu 
580 585 590 

Lys Gin Ala Leu Asn Gly Leu Ser Gin Leu Thr Tyr Thr Ser Gly Asn 
595 600 605 

Pro Thr Lys Arg Gin Ser Gin Leu He Asp Thr Leu Gin His Gin Val 
610 615 620 

Lys Ser Leu Glu Gin Gin Leu Ala Asp Ala Asp Arg Gin His Gin Glu 
625 630 635 640 

Val He Ala He Tyr Arg Thr His Leu Leu Ser Ala Ala Gin Gly His 
645 650 655 

Met Asp Glu Asp Val Gin Glu Ala Leu Leu Gin He He Gin Met Arg 
660 665 670 

Gin Gly Leu Val Cys 
675 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2041 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TCTCCCCCCT CCCCGACACA CACTCACAGG CCGGGCATTG ATGGTAATGT ATGCGAGGAA 60 

ACAGCAGAGA CTCAGTGATG GCTGTCACGA CCGGAGGGGG GACTCGCAGC CTTACCAGGC 120 

ACTTAAGTAT TCATCGAAGA GTCACCCCAG TAGCGGTGAT CACAGACATG AAAAGATGCG 180 

AGACGCCGGA GATCCTTCAC CACCAAATAA AATGTTGCGG AGATCTGATA GTCCTGAAAA 240 

CAAATACAGT GACAGCACAG GTCACAGTAA GGCCAAAAAT GTGCATACTC ACAGAGTTAG 300 

AGAGAGGGAT GGTGGGACCA GTTACTCTCC ACAAGAAAAT TCACACAACC ACAGTGCTCT 360 

TCATAGTTCA AATTCACATT CTTCTAATCC AAGCAATAAC CCAAGCAAAA CTTCAGATGC 420 

ACCTTATGAT TCTGCAGATG ACTGGTCTGA GCATATTAGC TCTTCTGGGA AAAAGTACTA 480 

CTACAATTGT CGAACAGAAG TTTCACAATG GGAAAAACCA AAAGAGTGGC TTGAAAGAGA 540 

ACAGAGACAA AAAGAAGCAA ACAAGATGGC AGTCAACAGC TTCCCAAAAG ATAGGGATTA 600 

CAGAAGAGAG GTGATGCAAG CAACAGCCAC TAGTGGGTTT GCCAGTGGAA AATCTACATC 660 

AGGAGACAAA CCCGTATCAC ATTCTTGCAC AACTCCTTCC ACGTCTTCTG CCTCTGGACT 720 

GAACCCCACA TCTGCACCTC CAACATCTGC TTCAGCGGTC CCTGTTTCTC CTGTTCCACA 780 

GTCGCCAATA CCTCCCTTAC TTCAGGACCC AAATCTTCTT AGACAATTGC TTCCTGCTTT 840 

GCAAGCCACG CTGCAGCTTA ATAATTCTAA TGTGGACATA TCTAAAATAA AT6AAGTTCT 900 

TACAGCAGCT GTGACACAAG CCTCACTGCA GTCTATAATT CATAAGTTTC TTACTGCTGG 960 

ACCATCTGCT TTCAACATAA CGTCTCTGAT TTCTCAAGCT GCTCAGCTCT CTACACAAGC 1020 

CCAGCCATCT AATCAGTCTC CGATGTCTTT AACATCTGAT GCGTCATCCC CAAGATCATA 1080 

TGTTTCTCCA AGAATAAGCA CACCTCAAAC TAACACAGTC CCTATCAAAC CTTTGATCAG 1140 

TACTCCTCCT GTTTCATCAC AGCCAAAGGT TAGTACTCCA GTAGTTAAGC AAGGACCAGT 1200 

GTCACAGTCA GCCACACAGC AGCCTGTAAC TGCTGACAAG CAGCAAGGTC ATGAACCTGT 1260 

CTCTCCTCGA AGTCTTCAGC GCTCAAGCCA GAGAAGTCCA TCACCTGGTC CCAATCATAC 1320 
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TTCTAATAGT AGTAATGCAT CAAATGCAAC AGTTGTACCA CAGAATTCTT CTGCCXIGATC 1380 

CACGTGTTCA TTAACGCCTG CACTAGCAGC ACACTTCAGT GAAAATCTCA TAAAACACGT 1440 

TCAAGGATGG CCTGCAGATC ATGCAGAGAA GCAGGCATCA AGATTACGCG AAGAAGCGCA 1500 

TAACATGGGA ACTATTCACA TGTCCGAAAT TTGTACTGAA TTAAAAAATT TAAGATCTTT 1560 

AGTCCGAGTA TGTGAAATTC AAGCAACTTT GCGAGAGCAA AGGATACTAT TTTTGAGACA 1620 

ACAAATTAAG GAACTTGAAA AGCTAAAAAA TCAGAATTCC TTCATGGTGT GAAGATGTGA 1680 

ATAATTGCAC ATGGTTTTGA GAACAGGAAC TGTAAATCTG TTGCCCAATC TTAACATTTT 1740 

TGAGCTGCAT TTAAGTAGAC TTTGGACCGT TAAGCTGGGC AAAGGAAATG ACAAGGGGAC 1800 

GGGGTCTGTG AGAGTCAATT CAGGG6AAAG ATACAA6ATT GATTTGTAAA ACCCTTGAAA 1860 

TGTAGATTTC TTGTAGATGT ATCCTTCACG TTGTAAATAT GTTTT6TAGA GTGAAGCCAT 1920 

GGGAAGCCAT GTGTAACAGA GCTTAGACAT CCAAAACTAA TCAATGCTGA GGTGGCTAAA 1980 

TACCTAGCCT TTTACATGTA AACCTGTCTG CAAAATTAGC TTTTTTAAAA AAAAAAAAAA 2040 

A 2041 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Arg Gly Asn Ser Arg Asp Ser Val Met Ala Val Thr Thr Gly Gly 
15 10 15 

Gly Thr Arg Ser Leu Thr Arg His Leu Ser lie His Arg Arg Val Thr 
20 25 30 

Pro Val Ala Val lie Thr Asp Met Lys Arg Cys Glu Thr Pro Glu He 
35 40 45 

Leu His His Gin He Lys Cys Cys Gly Asp Leu He Val Leu Lys Thr 
50 55 60 

Asn Thr Val Thr Ala Gin Val Thr Val Arg Pro Lys Met Cys He Leu 
65 70 75 80 
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Thr Glu Leu Glu Arg Gly Met Val Gly Pro Val Thr Leu His Lys Lys 
85 90 95 

He His Thr Thr Thr Val Leu Phe He Val Gin He His He Leu Leu 
100 105 110 

He Gin Ala He Thr Gin Ala Lys Leu Gin Met His Leu Met He Leu 
115 120 125 

Gin Met Thr Gly Leu Ser He Leu Ala Leu Leu Gly Lys Ser Thr Thr 
130 135 140 



Thr He Val Glu Gin Lys Phe His Asn Gly Lys Asn Gin Lys Ser Gly 
145 150 155 160 

Leu Lys Glu Asn Arg Asp Lys Lys Lys Gin Thr Arg Trp Gin Ser Thr 
165 170 175 

Ala Ser Gin Lys He Gly He Thr Glu Glu Arg 
180 185 

(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCCCTATCCA CTTAATAGAT GCCAATTCAA AGAGGTTAAA TGATTAGACT AAGGCACCTA 60 

ACTTATGTGA GTGTCAGGCT TCAATGCCTG TGTTAGAGCT ACTCCTTCAC ACAAAATAGT 120 

TCAGAACATA GAGAAGGACC AAGGTTAATA AATGATTTTC ATCCCAAACA CTAAACATGA 180 

TTGATGGGTA GAGGCTGCCC GAAGTACTGT GTAAAGATGG AATCTGAGAT AGAAGAATGC 240 

TGTGGTCAAT TAGTAATTCT TGCCCATGGA GGGATTAGTG ACACATGCCT TGTATATTTG 300 

TCATCTGTGG CCTAAACTCT GCCCCTGAAG GTTTGTTTTC TAATTCAGAG GTTTAAATTA 360 

ATCTAGCCCA CTTAATAAAA CCAGAGATCC TATGGGAAAT TTAGCCTAAG ACAGTGCTGG 420 

AAATTGCCAT ATGTTGATAC AAAGAAGTGT TTGGCCACAT TACAGGTCTC AGACTCAACT 480 

GCTATGTGTG ACTGCCGCTC TGTGCCTATG TCTTGCTTTT TTGCTGAGTT CCCTATTTCC 540 

ATATCTCCAG GTGAATCCAT GAGAAGCGAG AGGGTGGCTG AGAGGCCTGG GCCTCTGGGA 600 
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TTCCACCTTG CTATCTCTGC TCTTCAACCA TTGTTTTAGA CTCTGAACAC CAGATCCTCA 660 

TATCTGAAAG TGATTTGGAG ACCTGGGCAT CAAGTGCTCT TTTAAGAAGG GGCTATCCCA 720 

GAGGACTGTT CAAAAGTCTC ATTCAATAGA GATGTTGGAG TCCCAGAACA AAGTTAGGGA 780 

GCAAACCAGT AACCTATGCT GGTSGTAACA GAGGATCCTA CAATTACGTT TGTTTTTAAG 840 

ACAGGATTTT GCTGTGTTGC CCAGACTGGT CTCAAACTCC TGGGTTCAAG AGATCCATCC 900 

TCCCACCTCA GTCTCCTGAA AGCTGGGATG ACAGGCACAT GCCACCACAC CTAGCTCCTT 960 

ACAACCATTT ATTTTAACTT ATTTCATTTA TAACTGGTAT CTTTCATTTG TATGTGGCAG 1020 

CTAGAGATTT ATATAGGATG GAAGTAATTT ATTTTTAATT TAAATATTTC ATGTTGAACT 1080 

GTTTGCCTTG TATGGAACAT TTTACTTGGC CAATTCAAAT AAAAATAAAG TCAGCTTTGT 1140 

TTGTGACAAA AAAAAAAAAA AAA 1163 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Leu lie Gin Arg Ser Val Trp Pro His Tyr Arg Ser Gin Thr Gin 
15 10 15 

Leu Leu Cys Val Thr Ala Ala Leu Cys Leu Cys Leu Ala Phe Leu Leu 
20 25 30 

Ser Ser Leu Phe Pro Tyr Leu Gin Val Asn Pro 
35 40 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUEa^CE DESCRIPTION: SEQ ID NO: 11: 

GCGGTGGCTG AGGCGGCTGG GCCTAGGGTG CAGCGGGCGC GTCTGCGGCT GGTGTTGGCG 60 

CATCTCTAGA TCCTTTCCCG GAGTTCAGTT ATGGGTGTGA GAGGTTTGCA AGGATTTGTG 120 

GGAAGTACCT GCCCACATAT ATGTACAGTA GTAAATTTCA AAGAACTGGC AGAGCACCAC 180 

CGAAGCAAGT ATCCTGGATG TACCCCTACC ATTGTGGTTG ATGCCATGTG TTGTCTCAGA 240 

TATTGGTATA CTCCAGAATC TTGGATCTGC GGTGGCCAGT GGCGAGAATA CTTTTCTGCT 300 

TTGCGAGATT TTGTTAAAAC TTTTACGGCA GCTGGGATCA AGTTGATATT CTTCTTTGAT 360 

GGCATGGTGG AGCAGGATAA GAGAGATGAA TGGGTGAAAC GAAGGCTCAA GAACAACAGG 420 

GAGATATCCA GGATTTTTCA TTACATCAAG TCACACAAGG AGCAGCCAGG CAGAAATATG 480 

TTCTTCATCC CCTCAGGGCT AGCTGTGTTT ACACGATTTG CTCTAAAGAC ACTGGGCCAG 540 

GAAACTTTGT GTTCTTTGCA GGAAGCAGAT TATGAGGTAG CTTCCTATGG CCTCCAGCAT 600 

AACTGTCTTG GGATTCTGGG GGAAGACACT GATTACCTAA TCTATGACAC TTGTCCCTAC 660 

TTTTCAATTA GCGAGCTCTG CCTAGAGAGC CTGGACACCG TCATGCTCTG CAGAGAGAAG 720 

CTCTGTGAGA GTCTGGGCCT CTGTGTGGCC GACCTTCCTC TTCTGGCCTG CCTCCTTGGC 780 

GACGACATAA TCCCAGAGGG CATGTTTGAA AGCTTTAGGT ACAAATGCTT ATCGTCCTAC 840 

ACCTCTGTAA AAGAGAACTT TGACAAAAAA GGTAACATCA TATTAGCTGT GTCAGACCAT 900 

ATATCGAAAG TTCTTTACTT GTATCAAGGT GAGAAAAAAT TAGAAGAGAT ATTACCTCTG 960 

GGACCAAACA AAGCTCTTTT TTATAAAGGA ATGGCATCAT ATCTTTTACC AGGACAAAAA 1020 

TCTCCATGGT TTTTCCAAAA ACCCAAAGGT GTAATAACTT TGGACAAACA AGTAATATCC 1080 

ACGAGTTCAG ACGCCGAATC CAGGGAAGAA GTTCCCATGT GTTCAGATGC TGAATCCAGG 1140 

CAAGAAGTTC CCATGTGTAC AGGCCCTGAA TCCAGGCGAG AAGTTCCCGT GTATACAGAT 1200 

TCTGAACCCA GGCAAGAAGT TCCCATGTGT TCAGACCCTG AACCCAGGCA AGAAGTTCCC 1260 

ACATGTACAG GCCCTGAATC CAGGCGAGAA GTTCCCATGT GTTCAGACCC TGAACCCAGG 1320 

CAAGAAGTTC CCATGTGTAC AGGCCCTGAA GCCAGGCAAG AAGTTCCCAT GTATACAGAC 1380 

TCTGAACCCA GGCAAGAAGT TCCCATGTAT ACAGACTCTG AACCCAGGCA AGAAGTTCCC 1440 

ATGTATACAG GCTCTGAACC CAGGCAAGAA GTTCCCATGT ATACAGGCCC TGAATCCAGG 1500 

CAAGAAGTTC CCATGTATAC AGGCCCTGAA TCCAGGCAAG AAGTTTTAAT ACGGACAGAC 1560 
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CCTGAATCTA GGCAAGAAAT TATGTGTACA GGCCATGAAT CCAAACAGGA AGTTCCCATA 1620 

TGTACAGATC CTATATCCAA GCAAGAAGAC TCCATGTGTA CACACGCT6A AATCAATCAA 1680 

AAATTACCTG TAGCAACAGA TTTTGAATTT AAGCTAGAAG CTCTCATGTG TACAAACCCT 1740 

GAAATTAAAC AAGAAGACCC CACAAATGTG GGGCCTGAAG TAAAGCAACA AGTAACCATG 1800 

GTTTCAGACA CTGAAATCTT AAAGGTTGCT AGAACACATC ACGTCCAAGC AGAAAGCTAC I860 

CTGGTGTACA ACATCATGAG CAGTGGAGAG ATTGAATGCA GCAACACCCT AGAAGATGAG 1920 

CTTGACCAGG CCTTACCCAG CCAGGCCTTC ATTTACCGTC CCATTCGACA GCGGGTCTAC 1980 

TCACTCTTAC TGGAGGACTG TCAAGATGTC ACCAGCACCT GCCTAGCTGT CAAGGAGTGG 2040 

TTTGTGTATC CTGGGAACCC ACTGAGGCAC CCGGACCTCG TCAGGCCGCT GCAGATGACC 2100 

ATTCCAGGGG GAACGCCTAG TTTGAAT^TA TTATGGCTGA ACCAAGAGCC AGAAATACAG 2160 

GTTCGGCGCT TGGACACACT CCTAGCCTGT TTCAATCTTT CCTCCTCAAG AGAAGAGCTG 2220 

CAGGCTGTCG AAAGCCCATT TCAAGCTTTG TGCTGCCTCT TGATCTACCT CTTTGTCCAG 2280 

GTGGACACGC TTTGCCTGGA GGATTTGCAT GCGTTTATTG CGCAGGCCTT GTGCCTCCAA 2340 

G6AAAATCCA CCTCGCAGCT TGTAAATCTA CAGCCTGATT ACATCAACCC CAGAGCCGTG 2400 

CAGCTGGGCT CCCTTCTCGT CCGCGGCCTC ACCACTCTGG TTTTAGTCAA CAGCGCATGT 2460 

GGCTTCCCCT GGAAGACGAG TGATTTCATG CCCTGGAATG TATTTGACGG GAAGCTTTTT 2520 

CATCAGAAGT ACTTGCAATC TGAAAAGGGT TATGCTGTGG AGGTTCTTTT AGAACAAAAT 2580 

GGAGGTGGGG AAGACAGGGC TCCAGCTACC ACAGGACGGG CTCTGGGTAT AGCCGTTCCA 2640 

GTCAGGGACA GCCGTGGAGA GACCAGGGAC CAGGAAGCAG ACAGTATGAG CATGACCAGT 2700 

GGAGAAGGTA CTAGTCAACC TCCAGAAAGA GTATGGAGAG AAAAAGAGGC ACACCTGGAC 2760 

GCAGAGCCCT GCCAGCGCCC TCCTCTGCTG TTGCAGCTGC AAGGAGACCA TGCCTGTGGG 2820 

AGCCAGGCCT CGCTTGCATG AAGAAGGAAC GATGCCTTTT TCAATGGTGT CTCCCTCCCA 2880 

TTGTGCAGAA GAGCTTTTGT TGGCTTCTCT CCCGAGCTTG TGCCTGATTC TGTGGCCCAA 2940 

AACAATCATT GTTAACATCT TCATGTGTTT CATTCTGATC TTTCATTCAT ATATATGATG 3000 

CCTAGCTAAT TTCATTTTAA AATAAATGGG AATCTGTTGT AAAAAAAAAA AAAAAAAAAA 3060 

AAAAAAA 3067 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 916 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Val Arg Gly Leu Gin Gly Phe Val Gly Ser Thr Cys Pro His 
1 5 10 15 

lie Cys Thr Val Val Asn Phe Lys Glu Leu Ala Glu His His Arg Ser 
20 25 30 

Lys Tyr Pro Gly Cys Thr Pro Thr He Val Val Asp Ala Met Cys Cys 
35 40 45 

Leu Arg Tyr Trp Tyr Thr Pro Glu Ser Trp He Cys Gly Gly Gin Trp 
50 55 60 

Arg Glu Tyr Phe Ser Ala Leu Arg Asp Phe Val Lys Thr Phe Thr Ala 
65 70 75 80 

Ala Gly He Lys Leu He Phe Phe Phe Asp Gly Met Val Glu Gin Asp 
85 90 95 

Lys Arg Asp Glu Trp Val Lys Arg Arg Leu Lys Asn Asn Arg Glu He 
100 105 110 

Ser Arg He Phe His Tyr He Lys Ser His Lys Glu Gin Pro Gly Arg 
115 120 125 

Asn Met Phe Phe He Pro Ser Gly Leu Ala Val Phe Thr Arg Phe Ala 
130 135 140 

Leu Lys Thr Leu Gly Gin Glu Thr Leu Cys Ser Leu Gin Glu Ala Asp 
145 150 155 160 

Tyr Glu Val Ala Ser Tyr Gly Leu Gin His Asn Cys Leu Gly He Leu 
165 170 175 

Gly Glu Asp Thr Asp Tyr Leu He Tyr Asp Thr Cys Pro Tyr Phe Ser 
180 185 190 

He Ser Glu Leu Cys Leu Glu Ser Leu Asp Thr Val Met Leu Cys Arg 
195 200 205 

Glu Lys Leu Cys Glu Ser Leu Gly Leu Cys Val Ala Asp Leu Pro Leu 
210 215 220 

Leu Ala Cys Leu Leu Gly Asp Asp He He Pro Glu Gly Met Phe Glu 



75 



wo 98/49302 



PCTAJS98/08336 



225 230 235 240 

Ser Phe Arg Tyr Lys Cys Leu Ser Ser Tyr Thr Ser Val Lys Glu Asn 
245 250 255 

Phe Asp Lys Lys Gly Asn lie He Leu Ala Val Ser Asp His He Ser 
260 265 270 

Lys Val Leu Tyr Leu Tyr Gin Gly Glu Lys Lys Leu Glu Glu He Leu 
275 280 285 

Pro Leu Gly Pro Asn Lys Ala Leu Phe Tyr Lys Gly Met Ala Ser Tyr 
290 295 300 

Leu Leu Pro Gly Gin Lys Ser Pro Trp Phe Phe Gin Lys Pro Lys Gly 
305 310 315 320 

Val He Thr Leu Asp Lys Gin Val He Ser Thr Ser Ser Asp Ala Glu 
325 330 335 

Ser Arg Glu Glu Val Pro Met Cys Ser Asp Ala. Glu Ser Arg Gin Glu 
340 345 350 

Val Pro Met Cys Thr Gly Pro Glu Ser Arg Arg Glu Val Pro Val Tyr 
355 360 365 

Thr Asp Ser Glu Pro Arg Gin Glu Val Pro Met Cys Ser Asp Pro Glu 
370 375 380 

Pro Arg Gin Glu Val Pro Thr Cys Thr Gly Pro Glu Ser Arg Arg Glu 
385 390 395 400 

Val Pro Met Cys Ser Asp Pro Glu Pro Arg Gin Glu Val Pro Met Cys 
405 410 415 

Thr Gly Pro Glu Ala Arg Gin Glu Val Pro Met Tyr Thr Asp Ser Glu 
420 425 430 

Pro Arg Gin Glu Val Pro Met Tyr Thr Asp Ser Glu Pro Arg Gin Glu 
435 440 445 

Val Pro Met Tyr Thr Gly Ser Glu Pro Arg Gin Glu Val Pro Met Tyr 
450 455 460 

Thr Gly Pro Glu Ser Arg Gin Glu Val Pro Met Tyr Thr Gly Pro Glu 
465 470 475 480 

Ser Arg Gin Glu Val Leu He Arg Thr Asp Pro Glu Ser Arg Gin Glu 
485 490 495 

He Met Cys Thr Gly His Glu Ser Lys Gin Glu Val Pro He Cys Thr 
500 505 510 

Asp Pro He Ser Lys Gin Glu Asp Ser Met Cys Thr His Ala Glu He 
515 520 525 
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■ Asn Gin Lys Leu Pro Val Ala Thr Asp Phe Glu Phe Lys Leu Glu Ala 
530 535 540 

Leu Met Cys Thr Asn Pro Glu lie Lys Gin Glu Asp Pro Thr Asn Val 
545 550 555 560 

Gly Pro Glu Val Lys Gin Gin Val Thr Met Val Ser Asp Thr Glu He 
565 570 575 

Leu Lys Val Ala Arg Thr His His Val Gin Ala Glu Ser Tyr Leu Val 
580 585 590 

Tyr Asn He Met Ser Ser Gly Glu He Glu Cys Ser Asn Thr Leu Glu 
595 600 605 

Asp Glu Leu Asp Gin Ala Leu Pro Ser Gin Ala Phe He Tyr Arg Pro 
610 615 620 

He Arg Gin Arg Val Tyr Ser Leu Leu Leu Glu Asp Cys Gin Asp Val 
625 630 635 640 

Thr Ser Thr Cys Leu Ala Val Lys Glu Trp Phe Val Tyr Pro Gly Asn 
645 650 655 

Pro Leu Arg His Pro Asp Leu Val Arg Pro Leu Gin Met Thr He Pro 
660 665 670 

Gly Gly Thr Pro Ser Leu Lys He Leu Trp Leu Asn Gin Glu Pro Glu 
675 680 685 

He Gin Val Arg Arg Leu Asp Thr Leu Leu Ala Cys Phe Asn Leu Ser 
690 695 700 

Ser Ser Arg Glu Glu Leu Gin Ala Val Glu Ser Pro Phe Gin Ala Leu 
705 710 715 720 

Cys Cys Leu Leu He Tyr Leu Phe Val Gin Val Asp Thr Leu Cys Leu 
725 730 735 

Glu Asp Leu His Ala Phe He Ala Gin Ala Leu Cys Leu Gin Gly Lys 
740 745 750 

Ser Thr Ser Gin Leu Val Asn Leu Gin Pro Asp Tyr He Asn Pro Arg 
755 760 765 

Ala Val Gin Leu Gly Ser Leu Leu Val Arg Gly Leu Thr Thr Leu Val 
770 775 780 

Leu Val Asn Ser Ala Cys Gly Phe Pro Trp Lys Thr Ser Asp Phe Met 
785 790 795 800 

Pro Trp Asn Val Phe Asp Gly Lys Leu Phe His Gin Lys Tyr Leu Gin 
805 810 815 

Ser Glu Lys Gly Tyr Ala Val Glu Val Leu Leu Glu Gin Asn Gly Gly 
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820 825 830 

Gly Glu Asp Arg Ala Pro Ala Thr Thr Gly Arg Ala Leu Gly lie Ala 
835 840 845 

Val Pro Val Arg Asp Ser Arg Gly Glu Thr Arg Asp Gin Glu Ala Asp 
850 855 860 

Ser Met Ser Met Thr Ser Gly Glu Gly Thr Ser Gin Pro Pro Glu Arg 
865 870 875 880 

Val Trp Arg Glu Lys Glu Ala His Leu Asp Ala Glu Pro Cys Gin Arg 
885 890 895 

Pro Pro Leu Leu Leu Gin Leu Gin Gly Asp His Ala Cys Gly Ser Gin 
900 905 910 

Ala Ser Leu Ala 
915 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1914 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AGCTGTCTGC TCTCCTGGCA GGAATCGCTG AGGGAGGGAA ACGCGGCTCT GAATCAGCCC 60 

AGAACGAGCC TTCGGGAAGC TCACCCTCCG ATCTCGGTGT GATTGTTGTG ATTGTTGTGA 120 

TTTCCTGTCT CGTTTGCCTT GACCGCCATG TGAAAGAATC TGTTCCCCAG CTAGGTGGGG 180 

AAAATTCACA GGTGGGCTGT CTGTAGAGAG AACTGGCTGA TTAAAGGCTT CTCGTCCCGA 240 

TTTTGTGATA GCCAAGTGCT TGGCCTGGTC GACGGTCTTT GCTCCTTTAC AAATAAAGTG 300 

TTCTGTTTCA GTTCGTCCCA AGTTTTCCAT GAAGGGCAGT GGTTCCCTGA CCTCCCAGGT 360 

GCCTGGGCTT CCCCAGGTTC CTGATCTGGG GCTTGGGGCC CTGTGTTTGG GGATCGTGGC 420 

ACTGTGTGCA CCAGCCTGGA AGCACTGGGC CAGTCTTGGC CAAGCTTTCC ATCAGGGATG 480 

ATTTGATCTT GGTGCTACAG GTCTGTGGTA CGACCATTGT TCCACACCAC ATGTCATTAA 540 

TAATGCTTCC CAT6CTTCTG CTTGCAAATG ACCAGCCTTC CAAACAGCCA GAGCTGTTTC 600 
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GAGGTGTTTC TGCAGGCAGG TGCAGGCGTG CCCTCAAATA AGCTTTGCCA ATGGAGTCTC 660 

AGCAAGAGCA AAACCTGGTC AGGAAAGACA AAGCCTGGGA ATCCACCCCC ATGCCCTGCA 720 

GGTTGGCTGG CCCTGGAGCC ATTTATTATA GTGCTAATCA TGTTTCTAGG CAGGTGCAGA 780 

TGGCAAGGGC AGTGTCTTGG TGAGCTTTTT AGCACGAAGA GCCAGGTCTG TCGAAGCCTT 840 

TGTGAGAGCT GGAAACGCAG GTGTGCTGGG CATGCGCAGT ATGGGGTTTC GGGCTCAGGG 900 

CTTGCCCTTT GGCATCAGAC AGACCTGGCT TCGCATCCTG GATTTGCTTC TGACGTGCAC 960 

CCTTCCCTTT GGGTCTCGTG ATGTGAAATG GAGATGTTGT CATTTGTGAG GGCTCCATGA 1020 

AGTTTCGTTG AAATGACAAA TACTAATTTC TTCATCTGTG AAATGGAGAT AATAGTGCTG 1080 

ACCTCAGAAC AGCTGAGAGG ACTAAATGAA ATGATGTTGG ATGTAGCCAT AAAGAACGAA 1140 

GTCAGGCACT GGTGCACGCC TGGAATCCCA GCTCTTGGGA GACCGAGACA GGTGGATTGC 1200 

TTGAGCTCAG GAGTTTGAGA CCAGCCTGAG CAACATAGGG AGGTCCAGTC TCTACAAAAA 1260 

ATATGAAAAG TAGCTGGGCG TGGTGGCGCA TGCCTGTAGT CCCACTACTT GGAAGGCTTC 1320 

GTTGGGAGGA TCACTTGAGC CCAGAAGATT GAGGCTGCAG TAAGCCGTGA TCGTGCCACT 1380 

GCATTCCAGC CTGGGCAACA GAGCGAGACA CTGTCTCAAA TAAAAAAGAT GGGAATAGTA 1440 

GACACTGGGG GCTCCAGAAG GAGGGAGGGA GGGAGGAAGG GGAGGAAGGG CTGAAATGCT 1500 

TTCTATTGGA TACTATCTGG GCATATTACT TCCTGTGGTT CACTGTCTGG GTGACAGGAT 1560 

TCATAGAAGC CCAAACTTTA GCACCACGCA GCATACCCTT GTAACAAAGC CGCACACGTA 1620 

CGCCCTCAAG CTAAAACAAA AGTGGACCGG GAGGCCGAGG TCGGGGGATC ATGAGGTCAG 1680 

GAGTTTGAGA CCAGCCTGGC AGATAACGGT GAAACCCCGT CTCTACTAAA AATACCAAAA 1740 

AAAGTTAGCC GGACATGGTG GCAGGTGCCT GTAGTCCCAG CTACTTGGGA GGCTGGGGCA 1800 

GAAGAATCGC TTGAACCCAG GAGGCGGAGG TTGCAGTGAG CCGAGATTGC GCCACTGCAC 1860 

TCCAGCCTGT GCGACAGAGT GAGACTCCGT CTCAAAAAAA AAAAAAAAAA AAAA 1914 

{2) INFORMATION FOR SEQ ID NO: 14: 

{i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Thr Ser Leu Pro Asn Ser Gin Ser Cys Phe Glu Val Phe Leu Gin 
1 5 10 15 

Ala Gly Ala Gly Val Pro Ser Asn Lys Leu Cys Gin Trp Ser Leu Ser 
20 25 30 

Lys Ser Lys Thr Trp Ser Gly Lys Thr Lys Pro Gly Asn Pro Pro Pro 
35 40 45 

Cys Pro Ala Gly Trp Leu Ala Leu Glu Pro Phe lie lie Val Leu lie 
50 55 60 



Met Phe Leu Gly Arg Cys Arg Trp Gin Gly Gin Cys Leu Gly Glu Leu 
65 70 75 80 

Phe Ser Thr Lys Ser Gin Val Cys Arg Ser Leu Cys Glu Ser Trp Lys 
85 90 95 

Arg Arg Cys Ala Gly His Ala Gin Tyr Gly Val Ser Gly Ser Gly Leu 
100 105 110 

Ala Leu Trp His Gin Thr Asp Leu Ala Ser His Pro Gly Phe Ala Ser 
115 120 125 



Asp Val His Pro Ser Leu Trp Val Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 575 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CC6ACTCCCT TCTTTATGGC GTCGCTCCTG TGCTGTGGGC CGAAGCTGGC CGCCTGCGGC 60 

ATCGTCCTCA GCGCCTGGGG AGTGATCATG TTGATAATGC TCGGAATATT TTTCAATGTC 120 

CATTCCGCTG TGTTGATTGA GGACGTTCCC TTCACGGAGA AAGATTTTGA GAATGGCCCC 180 

CAGAACATAT ACAACCTTTA CGAGCAAGTC AGCTACAACT GTTTCATCGC TGCAGGCCTT 240 

TACCTCCTCC TCGGAGGCTT CTCTTTCTGC CAAGTTCGGC TCAATAAGCG CAAGGAATAC 300 
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ATGGTGCGCT AGGGCCCCGG CGCGTTTCCC CGCTCCAGCC CCTCCTCTAT TTAAAGACTC 360 

CCTGCACCGT GTCACCCAGG TCGCGTCCCA CCCTTGCCX5G CGCCCTCTGT GGGACTGGGT 420 

TTCCCGGGCG A6AGACTGAA TCCCTTCTCC CATCTCTGGC ATCCGGCCCC CGTGGAGAGG 480 

GCTGAGGCTG GGGGGCTGTT CCGTCTCTCC ACCCTTCGCT GTGTCCCGTA TCTCAATAAA 540 

GAGAATCTGC TCTCTTCAAA AAAAAAAAAA AAAAA 575 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ala Ser Leu Leu Cys Cys Gly Pro Lys Leu Ala Ala Cys Gly lie 
15 10 15 

Val Leu Ser Ala Trp Gly Val lie Met Leu lie Met Leu Gly lie Phe 
20 25 30 

Phe Asn Val His Ser Ala Val Leu He Glu Asp Val Pro Phe Thr Glu 
35 40 45 

Lys Asp Phe Glu Asn Gly Pro Gin Asn He Tyr Asn Leu Tyr Glu Gin 
50 55 60 

Val Ser Tyr Asn Cys Phe He Ala Ala Gly Leu Tyr Leu Leu Leu Gly 
65 70 75 80 

Gly Phe Ser Phe Cys Gin Val Arg Leu Asn Lys Arg Lys Glu Tyr Met 
85 90 95 

Val Arg 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



81 



wo 98/49302 



PCT/US98A)8336 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GNAGCCCAGGA GTCTTCTCAA CCTCTTCC 29 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 
ANCAGTCGCAA GTGCATAGTA ACCCAGTA 29 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TNCTCAGCTTT TATTTGGTTC TGAGTGTT 29 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TNTGCTCAGAC CAGTCATCTG CAGAATCA 29 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TNCAGCACTGT CTTAGGCTAA ATTTCCCA 29 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GNATTCGGCGT CTGAACTCGT GGATATTA 29 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
ANATGCCCAGA TAGTATCCAA TAGAAAGC 29 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CNACAGCACAG GAGCGACGCC ATAAAGAA 29 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Val Met Tyr Ala Arg Lys Gin Gin Arg Leu Ser Asp Gly Cys His 
15 10 15 

Asp Arg Arg Gly Asp Ser Gin Pro Tyr Gin Ala Leu Lys Tyr Ser Ser 
20 25 30 

Lys Ser His Pro Ser Ser Gly Asp His Arg His Glu Lys Met Arg Asp 
35 40 45 

Ala Gly Asp Pro Ser Pro Pro Asn Lys Met Leu Arg Arg Ser Asp Ser 
50 55 60 

Pro Glu Asn Lys Tyr Ser Asp Ser Thr Gly His Ser Lys Ala Lys Asn 
65 70 75 80 

Val His Thr His Arg Val Arg Glu Arg Asp Gly Gly Thr Ser Tyr Ser 
85 90 95 
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' Pro Gin Glu Asn Ser His Asn His Ser Ala Leu His Ser Ser Asn Ser 
100 105 110 

His Ser Ser Asn Pro Ser Asn Asn Pro Ser Lys Thr Ser Asp Ala Pro 
115 120 125 

Tyr Asp Ser Ala Asp Asp Trp Ser Glu His He Ser Ser Ser Gly Lys 
130 135 140 

Lys Tyr Tyr Tyr Asn Cys Arg Thr Glu Val Ser Gin Trp Glu Lys Pro 
145 150 155 160 

Lys Glu Trp Leu Glu Arg Glu Gin Arg Gin Lys Glu Ala Asn Lys Met 
165 170 175 

Ala Val Asn Ser Phe Pro Lys Asp Arg Asp Tyr Arg Arg Glu Val Met 
180 185 190 

Gin Ala Thr Ala Thr Ser Gly Phe Ala Ser Gly Lys Ser Thr Ser Gly 
195 200 205 

Asp Lys Pro Val Ser His Ser Cys Thr Thr Pro Ser Thr Ser Ser Ala 
210 215 220 

Ser Gly Leu Asn Pro Thr Ser Ala Pro Pro Thr Ser Ala Ser Ala Val 
225 230 235 240 

Pro Val Ser Pro Val Pro Gin Ser Pro He Pro Pro Leu Leu Gin Asp 
245 250 255 

Pro Asn Leu Leu Arg Gin Leu Leu Pro Ala Leu Gin Ala Thr Leu Gin 
260 265 270 

Leu Asn Asn Ser Asn Val Asp He Ser Lys He Asn Glu Val Leu Thr 
275 280 285 

Ala Ala Val Thr Gin Ala Ser Leu Gin Ser He He His Lys Phe Leu 
290 295 300 

Thr Ala Gly Pro Ser Ala Phe Asn He Thr Ser Leu He Ser Gin Ala 
305 310 315 320 

Ala Gin Leu Ser Thr Gin Ala Gin Pro Ser Asn Gin Ser Pro Met Ser 
325 330 335 

Leu Thr Ser Asp Ala Ser Ser Pro Arg Ser Tyr Val Ser Pro Arg He 
340 345 350 

Ser Thr Pro Gin Thr Asn Thr Val Pro He Lys Pro Leu He Ser Thr 
355 360 365 

Pro Pro Val Ser Ser Gin Pro Lys Val Ser Thr Pro Val Val Lys Gin 
370 375 380 

Gly Pro Val Ser Gin Ser Ala Thr Gin Gin Pro Val Thr Ala Asp Lys 
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385 390 

Gin Gin Gly His Glu Pro Val Ser 
405 

Gin Arg Ser Pro Ser Pro Gly Pro 
420 

Ala Ser Asn Ala Thr Val Val Pro 
435 440 

Cys Ser Leu Thr Pro Ala Leu Ala 
450 455 

Lys His Val Gin Gly Trp Pro Ala 
465 470 



395 400 

Pro Arg Ser Leu Gin Arg Ser Ser 
410 415 

Asn His Thr Ser Asn Ser Ser Asn 
425 430 

Gin Asn Ser Ser Ala Arg Ser Thr 
445 

Ala His Phe Ser Glu Asn Leu lie 
460 

Asp His Ala Glu Lys Gin Ala Ser 
475 480 



Arg Leu Arg Glu Glu Ala His Asn Met Gly Thr lie His Met Ser Glu 
485 ■ 490 495 

He Cys Thr Glu Leu Lys Asn Leu Arg Ser Leu Val Arg Val Cys Glu 
500 505 510 

He Gin Ala Thr Leu Arg Glu Gin Arg He Leu Phe Leu Arg Gin Gin 
515 520 525 

lie Lys Glu Leu Glu Lys Leu Lys Asn Gin Asn Ser Phe Met Val 
530 535 540 
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What is claimed is: 

1. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 99 to nucleotide 902; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 162 to nucleotide 902; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 87 to nucleotide 219; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone d25_4 deposited under accession number 
ATCC 98415; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone ci25_4 deposited under accession number ATCC 98415; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone ci25_4 deposited under accession number ATCC 
98415; 

(h) a polynucleotide encoding a matvu*e protein encoded by the cDNA 
insert of clone ci25_4 deposited under accession number ATCC 98415; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:2 having biological activity, the fragment 
comprising the amino add sequence from amino add 129 to amino add 138 of 
SEQIDNO:2; 

(k) a polynudeotide which is an allelic variant of a polynucleotide of 
{a)-(h) above; 

(1) a polynudeotide which encodes a spedes homologue of the protein 
of (i) or (j) above ; and 

(m) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(j). 
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2. The polynucleotide of claim 1 wherein said polynucleotide is operably 
linked to at least one expression control sequence. 

3. A host cell transformed with the polynucleotide of claim 2. 

4. The host cell of claim 3, wherein said cell is a mammalian cell. 

5. A process for producing a protein encoded by the polynucleotide of claim 
2, which process comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 
medium; cind 

(b) purifying said protein from the culture, 

6. A protein produced according to the process of claim 5. 

7. The protein of claim 6 comprising a mature protein. 

8. A protein comprising an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) fragments of the amino add sequence of SEQ ID NO:2 comprising 
the amino add sequence from amino add 129 to amino add 138 of SEQ ID NO:2; 
and 

(c) the amino add sequence encoded by the cDNA insert of done 
ci25_4 deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

9. The protein of claim 8, wherein said protein comprises the amino add 
sequence of SEQ ID NO:2. 

10. A composition comprising the protein of daim 8 and a pharmaceutically 
acceptable carrier. 

11. An isolated gene corresponding to the cDNA sequence of SEQ ID NOrl. 
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12. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:3; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 283 to nucleotide 1158; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:3 from nucleotide 1 to nucleotide 789; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone da228_6 deposited under accession 
number ATCC 98415; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone da228_6 deposited imder accession number ATCC 98415; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done da228_6 deposited under accession number 
ATCC 98415; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of done da228_6 deposited imder accession number ATCC 98415; 

(h) a polynudeotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:4; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:4 having biological activity, the fragment 
comprising the amino add sequence from amino add 141 to amino add 150 of 
SEQIDNa4; 

(j) a polynudeotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 

of (h) or (i) above ; and 

(1) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynudeotides spedfied in (a)-(i). 

13. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:4; 
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(b) the amino add sequence of SEQ ID NO:4 from amino acid 1 to 
amino add 169; 

(c) fragments of the amino add sequence of SEQ ID NO:4 comprising 
the amino add sequence from amino add 141 to amino add 150 of SEQ ID NO:4; 
and 

(d) the amino acid sequence encoded by the cDNA insert of done 
da228_6 deposited under accession number ATCC 98415; 

the protdn being substantially free from other mammalian proteins. 

14. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:3. 

15. An isolated polynudeotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:5; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nucleotide 152 to nucleotide 2182; 

(c) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:5 from nudeotide 2 to nucleotide 931; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone du410_5 deposited under accession 
number ATCC 98415; 

(e) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of clone du410_5 deposited imder accession number ATCC 98415; 

(f) a polynudeotide comprising the nucleotide sequence of a mature 
protein coding sequence of done du410_5 deposited imder accession number 
ATCC 98415; 

(g) a polynudeotide encoding a mature protein encoded by the cDNA 
insert of done du410_5 deposited under accession number ATCC 98415; 

(h) a pol)mudeotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:6; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:6 having biological activity, the fragment 
comprising the amino acid sequence from amino add 333 to amino add 342 of 
SEQIDNO:6; 



90 



wo 98/49302 



PCr/US98/08336 



(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-{i). 

16. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NC):6; 

(b) the amino add sequence of SEQ ID NO:6 from amino add 1 to 
amino acid 260; 

(c) fragments of the amino add sequence of SEQ ID NO:6 comprising 
the amino add sequence from amino add 333 to amino add 342 of SEQ ID NO:6; 
and 

(d) the amino add sequence encoded by the cDNA insert of done 
du410_5 deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

17. An isolated gene corresponding to the cDNA sequence of SEQ ID NChS. 

18. An isolated polynucleotide selected from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

Na7; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:7 from nudeotide 51 to nucleotide 611; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:7 from nudeotide 1 to nucleotide 525; 

(d) a poljmudeotide comprising the nudeotide sequence of the full- 
length protein coding sequence of done eh80_l deposited under accession number 
ATCC 98415; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone eh80_l deposited under accession number ATCC 98415; 
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(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of done eh80_l deposited under accession number ATCC 
98415; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone eh80_l deposited imder accession number ATCC 98415; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:8; 

(i) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:8 having biological activity, the fragment 
comprising the amino add sequence from amino acid 88 to amino acid 97 of SEQ 
IDNO:8; 

(j) a polynudeotide which is an allelic variant of a polynudeotide of 
(a)-(g) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynudeotide that hybridizes imder stringent conditions to any 
one of the polynudeotides specified in (a)-(i)- 

19. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:8; 

(b) the amino add sequence of SEQ ID NO:8 from amino add 1 to 
amino acid 158; 

(c) fragments of the amino add sequence of SEQ ID NO:8 comprising 
the amino add sequence from amino add 88 to amino add 97 of SEQ ID NO:8; and 

(d) the amino add sequence encoded by the cDNA insert of clone 
eh80_l deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

20. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:7. 

21. An isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nudeotide sequence of SEQ ID 

NO:9; 
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(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 431 to nucleotide 559; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 518 to nucleotide 559; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:9 from nucleotide 190 to nucleotide 547; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone er369_l deposited imder accession 
number ATCC 98415; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone er369_l deposited under accession number ATCC 98415; 

(g) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone er369_l deposited under accession number 
ATCC 98415; 

(h) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of done er369_l deposited under accession number ATCC 98415; 

(i) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NQIO; 

(j) a polynudeotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:10 having biological activity, the fragment 
comprising the amino add sequence from amino add 16 to amino add 25 of SEQ 
IDNOrlO; 

(k) a polynucleotide which is an allelic variant of a polynudeotide of 
(a)-(h) above; 

(1) a polynudeotide which encodes a spedes homologue of the protein 
of (i) or (j) above ; and 

(m) a polynudeotide that hybridizes vmder stringent conditions to any 
one of the pol5mudeotides specified in (a)-0. 

22. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:10; 

(b) the amino add sequence of SEQ ID NO:10 from amino add 1 to 
amino add 39; 
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(c) fragments of the amino add sequence of SEQ ID NO.IO comprising 
the amino acid sequence from amino acid 16 to amino acid 25 of SEQ ID NO.IO; 
and 

(d) the amino add sequence encoded by the cDNA insert of done 
er369_l deposited imder accession niunber ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

23. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:9. 

24. An isolated polynudeotide selected from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

NO:ll; 

(b) a polynucleotide comprising the nudeotide sequence of SEQ ID 
NO:ll from nucleotide 91 to nucleotide 2838; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:ll from nucleotide 2209 to nucleotide 2838; 

(d) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:ll from nucleotide 839 to nucleotide 1197; 

(e) a polynucleotide comprising the nudeotide sequence of the full- 
length protein coding sequence of done fhl23_5 deposited imder accession 
number ATCC 98415; 

(f) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of done fhl23_5 deposited imder accession number ATCC 98415; 

(g) a polynudeotide comprising the nudeotide sequence of a mature 
protein coding sequence of done fhl23_5 deposited under accession number 
ATCC 98415; 

(h) a polynudeotide encoding a matiue protein encoded by the cDNA 
insert of done fhl23_5 deposited under accession ntunber ATCC 98415; 

(i) a pol)mudeotide encoding a protein comprising the amino add 
sequence of SEQ ID Nai2; 

(j) a polynudeotide encoding a protein comprising a fragment of the 
amino add sequence of SEQ ID NO:12 having biological activity, the fragment 
comprising the amino add sequence from amino acid 453 to amino acid 462 of 
SEQIDNO:12; 
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(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide v^rhich encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(j). 

25. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:12; 

(b) the amino add sequence of SEQ ID NO:12 from amino acid 251 to 
amino acid 369; 

(c) fragments of the amino add sequence of SEQ ID NO:12 comprising 
the amino add sequence from amino add 453 to amino add 462 of SEQ ID NO:12; 
and 

(d) the amino add sequence encoded by the cDNA insert of clone 
fhl23_5 deposited imder accession niunber ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

26. An isolated gene corresponding to the cDNA sequence of SEQ ID NOl 1 . 

27. An isolated pol)mudeotide selected from the group consisting of: 

(a) a polynudeotide comprising the nucleotide sequence of SEQ ID 

NO:13; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO:13 from nudeotide 568 to nudeotide 978; 

(c) a polynudeotide comprising the nucleotide sequence of SEQ ID 
NO:13 from nucleotide 1084 to nucleotide 1854; 

(d) a polynudeotide comprising the nudeotide sequence of the full- 
length protein coding sequence of done fm60_l deposited imder accession 
number ATCC 98415; 

(e) a polynudeotide encoding the full-length protein encoded by the 
cDNA insert of done fm60_l deposited under accession ntimber ATCC 98415; 
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(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone fm60_l deposited under accession number 
ATCC 98415; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
insert of clone fm60_l deposited under accession number ATCC 98415; 

(h) a polynucleotide encoding a protein comprising the amino add 
sequence of SEQ ID NO:14; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:14 having biological activity, the fragment 
comprising the amino add sequence from amino add 63 to amino add 72 of SEQ 
IDNO:14; 

(j) a polynucleotide which is an allelic variant of a polynudeotide of 
(aHg) above; 

(k) a polynudeotide which encodes a spedes homologue of the protein 
of (h) or (i) above ; and 

(1) a polynudeotide that hybridizes under stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

28. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:14; 

(b) fragments of the amino add sequence of SEQ ID NO:14 comprising 
the amino add sequence from amino add 63 to amino add 72 of SEQ ID NO:14; 
and 

(c) the amino add sequence encoded by the cDNA insert of done 
fm60_l deposited imder accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

29. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:13. 

30. An isolated polynudeotide selected from the group consisting of: 

(a) a polynudeotide comprising the nudeotide sequence of SEQ 

IDNaiS; 
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(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO: 15 from nucleotide 16 to nucleotide 309; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:15 from nucleotide 127 to nucleotide 309; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone fr473_2 deposited under accession 
number ATCC 98415; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone fr473_2 deposited under accession number ATCC 98415; 

(f) a polynucleotide comprising the nucleotide sequence of a mature 
protein coding sequence of clone fr473_2 deposited imder accession nimil>er 
ATCC 98415; 

(g) a polynucleotide encoding a mature protein encoded by the cDNA 
ii\sert of clone fr473_2 deposited imder accession numl)er ATCC 98415; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:16; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:16 having biological activity, the fragment 
comprising the amino add sequence from amino acid 44 to amino acid 53 of SEQ 
IDNO:16; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide that hybridizes tmder stringent conditions to any 
one of the polynucleotides specified in (a)-(i). 

31. A protein comprising an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:16; 

(b) the amino add sequence of SEQ ID NO:16 from amino add 1 to 
amino add 58; 
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(c) fragments of the amino add sequence of SEQ ID NO:16 comprising 
the amino acid sequence from amino acid 44 to amino acid 53 of SEQ ID NO:16; 
and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
fr473_2 deposited under accession number ATCC 98415; 

the protein being substantially free from other mammalian proteins. 

32. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:15. 
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FIGURE lA 

Hindlll 1 




BamHI 2772 Clal 2666 



Plasmid name: pED6dpc2 
Plasmid size: 5374 bp 



Comments/References: pED6dpc2 is derived from pED6dpc1 by insertion of a new 
polylinker to facilitate cDNA cloning. SST cDNAs are cloned between EcoRI and Notl. 
pED vectors are described in Kaufman et al.(1991), NAB 19: 4485-4490, 
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FIGURE IB 




Sapl 2135 



Plasmid name: pNOTs 
Plasmid size: 4529 bp 



Comments/Referoncea: pNOTs is a derivative of pMT2 (Kaufman at al,1089. MoLCell.Biol.9: 1741-1750). 
DHFR was delated and a new polylinker was inserted between EcoRI and Hpal. Ml 3 origin 
of replication was inserted in the Ctal site. SST cDNAs are doned between EcoRI and 
NotI 



2/2 



INTERNATIONAL SEARCH REPORT 



nal AppDcatkm No 



PCl/wi 98/08336 



A. CLASSIFICATION OF SUBJECT MATTER , . ^ . , 

IPC 6 C12N15/12 CQ7K14/47 A61K38/17 



Acootding to tntamalional Pntent 



(IPC) or to both 



B. FIELDS SEARCHED 



Mintmum doomnentBtian •earohed (dassiHoalion system followed by otasstftoation syntets) 

IPC 6 C12N C07K A61K 



Ooourrantolionsoaratwd other than niintnrmm dooimientBtnn to thft extern that such ftooumenls am in the ftetds soarehed 



Elaotronio data bos* oonsuled during th* tntomatkmalsaareh (name of data bas* and, where prececal, saarohtarma used) 



C DOCUWENTS CONSIDERED TO BE RELEVANT 



Category- 



RakwanttoobunNo. 



EP a 383 129 A (MOLECULAR DIAGNOSTICS 

INC.) 22 August 1990 

see page 9, line 35-44 

see page 9 - page 10; claims 

see figures 8 » 11, 13 

WO 97 07198 A (GENETICS INSTITUTE INC.) 
27 February 1997 

US 5 536 637 A (JACOBS KENNETH) 
16 July 1996 

cited in the application 



1-11 



□ 



FurthardooufnantaamBatedinlhtt < 



HI 



Patent famity fnafnbafs am Kstad in annat. 



* Spooial oategones of <utad dooumanti : 

-A* doeumantderiwigthagMMral state of the artwhioh is not 

eonsidarad to tw of partietilar mlawanca 
*E* oariardoaumafltbiitpiJblisbedonorafterthe mtamational 



T latardoeumentpiMshad after the intematnnal Ring date 
or priority date and not in oonffal with the appieation but 
oftad to understand thttprinotpte or theory un da i lying the 



•L" doeumentivhieh may thmwdoubte on priority olatnrt(s)or 
which is cted to ttstabfish tha pubtioation date of another 
citation or olhor special reason (as specified) 

■QT document refaning to an oral dbdosum. use, axWbibonor 

itputiSshedpriortotheintematioiud fifing date liut 



later than the priority date etttimed 



-X* document of partioularrBtevanoe; the daimed invwttion 
cannot be canstdend novel or cannot be oonaidared to 
invaiva an inventive Btap when the document is tafcenokme 

nr document of paitioularrelevanoe; the eiaimed invention 

cannot be oonsidarod to invoke an inventive stepwhenthe 
doeuntant is comtiined with one or more other suohifaou- 
mentB,suoh combination being obvious to a person skilled 
htheajt 

*&* doeument member of the same patent family 



Date of the oompletion of theaitemational ee aroh 

8 July 1998 



Dote of maifing of the international searoh report 



0 8. to. 98 



Name and maiflng address of the ISA 

European Patent OfHoe, P.B. 581 8 Potentiaan 2 
NL-2280HVR9swt»( 
TeL (-1-31-70) 340-2040. Tx. 31 651 apo nl, 



Authorizad offiear 



Macchia. G 



INTERNATIONAL SEARCH REPORT 



I. ittonal applioafon No. 

PCT/US 98/08336 



Box I Observations where certain dai ms were found unsearchable (Continuation of Kent 1 of first sheet) 



This International Search Report has not been eslabBshed In re^aect of certain claims under Artide 17(2)Ca) tor the followiRg reasons: 
1.1 I Claims Has.: 

— because they relate to subject matter not rB<;|UtrBd to be searched by this Authority, namely: 



] ClainrisNos.: 

because they relate to parts of the International Application that do not oompfy with the prescribed rec^trements to such 

an extent that no meaningful International Search can be canied out, specifically: 



3. I Claims Nos.: 

— t>eoau3e they are dependent claims and am not drafted in accordance with the second and third sentences of Rule 6.4<a). 



Box tl Observations where unity of invention is iaddng (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as fellows: 

see further Information sheet 



1. I I As all required acfcfitiondsean^h fees were timely paid by the app&cafTt, this International Saanih Report oo^^ 
' ' searchable claims. 

2. [ [ As all searchable claims could be searched without effort pistifying an additional fee, this Authority cfid not invite payntent 

of any additional fee. 

3. I I As only some of the required aJtfitional search fees were timely patd tiy the applicant, this International Search Report 
I — > covers only those claims for which fees were paid, specifically claims Mos.: 

4. I X I No required additional search fees were timely paid by the applicant Consequently, this Intematio^ 

restricted to the invention first mentioned in ttie claims; it is covered by clatms Mos.: 

1-11 



Remaitt on Protest 



□ 
□ 



The ad(£tional search fees were accompanied t>y the opptioanfs protest 
No protest accompanied the payment of addSional search fees. 



International Application No. PCT/ US 98/08336 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-11 

Polynucleotide comprising the nucleotide sequence of 
Seq.ID:! and encoding a polypeptide of Seq.ID:2 or 
fragments. Polynucleotide fragments, variants, homologues, 
gene thereof. Host cell transformed with said 
polynucleotide. Protein comprising an amino acid sequence of 
Seq.ID: 2 or fragments thereof. Process for producing said 
protein. 



2. Claims: 12-14 

Polynucleotide comprising the nucleotide sequence of 
Seq.ID: 3 and encoding a polypeptide of Seq.ID: 4 or 
fragments. Polynucleotide fragments, variants, homologues, 
gene thereof. Protein comprising an amino acid sequence of 
Seq.ID: 4 or fragments thereof. 



3. Claims: 15-17 

As invention 2 but concerning Seq.ID: 5 and 6. 



4. Claims: 18-20 

As invention 2 but concerning Seq.ID:? and 8. 



5. Claims: 21-23 

As invention 2 but concerning Seq.I0:9 and 10. 



6. Claims: 24-26 

As invention 2 but concerning Seq.ID: 11 and 12. 



7. Claims: 27-29 

As invention 2 but concerning Seq.ID: 13 and 14. 



8. Claims: 30-32 

As invention 2 but concerning Seq.ID: 15 and 16. 
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